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ABSTRACT 


This  study  is  concerned  with  the  limiting  distributions  which 
arise  from  a  stochastic  learning  model  proposed  by  Bush  and  Mosteller  and 
discussed  in  "Stochastic  Models  for  Learning"  [ 1 ] .  A  summary  of  their 
work  devoted  to  the  class  of  learning  processes  with  experimenter-controlled 
events  is  given  in  the  first  chapter  of  this  thesis.  For  certain  special 
cases  of  the  Bush-Mosteller  model,  the  limiting  distributions  have  been 
found.  The  methods  used  in  determining  them  are  discussed  in  Chapter  II. 

In  the  third  chapter,  some  new  results  are  obtained.  For  processes 
involving  two  experimenter-controlled  events,  a  formula  for  the  raw  moments 
of  the  limiting  distribution  is  obtained,  and  a  method  is  developed  by 
which  the  limiting  distribution  can  be  determined  for  any  given  member  of 
a  special  class  of  these  processes.  This  procedure  is  generalized  to  apply 
to  processes  with  t-experimenter-controlled  events  (t=2,5»»->-)  • 
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CHAPTER  I 

A  STOCHASTIC  LEARNING  MODEL 

In  a  historical  review  of  the  mathematical  theory  of  learnings, 

Bush  [2]  suggests  that  the  development  of  the  present  structure  of  the  theory 
began  in  19^9  when  several  independent  research  programs  were  initiated.  As 
a  result  of  these  investigations,  several  learning  models  were  formulated  and, 
of  these,  most  are  either  of  the  "stimulus -samp ling"  or  "stochastic-response" 
type.  The  first  of  these  refers  to  models  in  which  are  incorporated  a  repre¬ 
sentation  of  the  stimulus  environment  of  an  experimental  subject  and  some 
means  of  describing  the  process  by  which  environment  influences  behaviour. 

The  assumptions  made  in  such  models  lead  to  theorems  concerning  the  way  in 
which  the  response  probability,  as  an  index  of  learning,  is  transformed  from 
one  trial  to  another.  Stochastic  learning  models  assume  a  set  of  such  trans¬ 
formation  rules,  and  are  concerned  with  predicting  their  consequences  to 
learning,  as  measured  by  response  probabilities.  In  Bush  and  Estes'  "Studies 
in  Mathematical  Learning  Theory"  [3]  both  types  are  discussed  and  several 
specific  models  examined  and  compared. 

One  of  the  earliest  learning  models  of  the  stochastic-response 
type  is  that  proposed  by  Bush  and  Mosteller  [4],  It  is  assumed  that  the 
transformation  of  response  probabilities  is  carried  out  by  the  application 
of  linear  operators.  Much  of  the  work  done  jointly  either  by  these  authors 
alone  [4],  [53 »  [6]  or  with  Thompson  [7]  is  contained  in  their  publication, 
"Stochastic  Models  for  Learning". 

The  present  chapter  of  this  thesis  provides  an  introduction  to  the 
learning  model  of  Bush  and  Mosteller  and  summarizes  those  results  which 
prove  useful  in  later  chapters.  Section  one  describes  the  model  and  some 


. 
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properties  of  the  mathematical  devices  upon  which  it  is  based.  A  description 
of  the  specialized  model  which  forms  the  subject  of  this  thesis  appears  in 

section  two.  Finally,  in  section  three,  a  generalization  is  suggested  to 
which  some  of  the  results  of  the  previous  section  are  easily  extended. 


SECTION  ONE 


The  model  discussed  in  this  thesis  describes  the  learning  process 
of  an  organism  or  subject  in  terms  of  the  responses  and  outcomes  occurring 
in  each  of  a  succession  of  trials  in  which  the  subject  chooses  to  make  exactly 
one  of  several  responses  to  a  given  stimulus.  The  term,  "learning",  as  it  is 
used  here,  will  refer  to  a  systematic  change  in  an  organism's  behavior. 


In  general,  a  set  of  r  mutually  exclusive  and  exhaustive  response 
classes,  A^,  A^,  ...,  A^,  is  considered  and  a  trial  defined  as  an  oppor¬ 
tunity  for  the  selection  of  exactly  one  from  among  these  r  possible  alter¬ 
natives.  For  a  given  trial,  there  is  associated  with  each  A^  (i  =  l,2,...,r) 
a  probability,  p^,  of  its  being  chosen.  Since  these  response  classes  are 

mutually  exclusive  and  exhaustive, 

r 


(1.1.1) 


and 


0  < 


Pj< 


It  is  assumed  that  on  a  given  trial,  a  response,  A^ ,  will  have 


0  ,  of  mutually  exclusive  and 
s 


an  outcome,  0^,  from  a  set,  0^,  0^,  ... 
exhaustive  outcomes  and  that  the  occurrence  of  this  response-outcome  pair, 


(A  ,0,  ),  will  alter,  in  a  unique  manner,  the  set  of  r  response  probabili- 
J  k 


ties  for  the  succeeding  trial. 


More  generally,  it  may  be  assumed  that  the  set  of  response  prob¬ 
abilities,  p1#  p  ,  ...  pr. 


are  changed  from  trial  to  trial  in  a  unique,  way. 


■  -- 

. 
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by  the  occurrence  of  one  among  a  set  of  t  events,  E^E^,  Et ,  and 
that  the  way  in  which  this  change  occurs  is  independent  of  earlier  events 
in  the  succession  of  trials. 


For  our  purposes,  it  will  be  sufficient  to  consider  the  case 
r  =  s  =  2  in  the  subsequent  discussion.  The  learning  experiment,  with  this 
restriction,  may  be  summarized  as  follows! 

During  a  given  trial,  a  subject  selects  one  and  only  one  of  two 
responses  and  with  probabilities  p  and  1  -  p,  respectively. 

For  either  choice,  exactly  one  of  the  outcomes  0^  and  0^  occurs.  Finally, 
the  probabilities  of  response,  p  and  1  -  p  ,  are  changed  from  trial  to 
trial  by  the  occurrence  of  events,  E^,  E^,  . . . ,  Et  in  a  way  which  is 
independent  of  the  previous  occurrence  of  these  events. 

The  model  uses  the  response  probabilities,  p  and  1  -  p,  as  an 
index  of  behavior  and  changes  in  behavior  are  represented  by  changes  in  these 
probabilities.  We  define  the  probability  vector  p  as 


where  p  and  q  are  the  probabilities  of  responses  A^  and  A  ,  respect¬ 
ively,  and,  by  condition  (1,1, l),  p  +  q  =  1.  It  is  assumed  that  changes  in 
these  probabilities  due  to  the  occurrence  of  event  E^(i  =  l,2,...,t)  may 
be  represented  by  the  application  of  a  two  by  two  matrix  operator,  T^,  to 
p  where  T.  is  of  the  form 

—  —l 


u 


11,  i 


-  U21,i 


u 

u 


12,  i  - 
22,  i  - 


(i=l,2,...,t) 


That  is,  the  new  vector  of  response  probabilities  is  given  by 


. 


(i=l,2,...,t)  . 
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T.  £  = 
—1 


r 


u1  .  , 

11,1  12, i 


L  U21,i  U22,i  J  L 


P 

q 


-  r 


Ull,iP  +  u12,iq  1 
U21,iP  +  U22,iq 


The  new  probability  of  Ax  is  uu  +  u12  and  of  A2>  u21  .p  +  u£2  ^ 

By  condition  (1,1,1),  therefore. 


(ull,iP  +  U12,iq)  +  (u21,iP  +  U22,iq)  “  1 


or 


(uu,i  +  U21,i)p  +  (u12,i  +  U22,i)q  =  1  • 


Since  this  is  true  for  all  p  and  q  satisfying  0  <  p  <  1,  0  <  q  <  1, 

and  p  +  q  =  1,  then,  in  particular,  for  p 


+  q  =  1,  then,  in  particular, 


=  1  and  q  =  0, 


Ull,i  +  U21,i  1 


and  for  p  =  0,  q  =  1, 


U12,i  +  U22,i  ’  1  ’ 


That  is,  u  =  1  -  us  ,  u22>.  =  1  -  u12;i,  and 


*1  = 


1"U21,i  U12,i 


L  U 


21,  i 


1-u 


12, i' 


( i— 1,2, ...» t) 


Let  b.  =  u01  .  and  a.  =  u,_  .  so  that 
l  21,i  i  12,i 


r  1-b 


T  = 
~i 


1-a. 

l 


( i=l»2, . . . , t) 


Then,  in  this  new  notation, 


-(l-bi)p+aiq  - 

r QiP  i 

£  = 

_biP+(l-ai)q 

lVJ 

) 


, 


where 
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(1.1.2) 

QiP  =  (l-bi)p  +  a±q 

^q  =  b^p  +  (l-ai)q  (  i=l ,  2,  .  .  .  ,  t )  . 

If  we  let  *  1  -  a  -  b^  and  a^  =  (l-a^JV  ( i=l , 2, . . .  ,  t )  then,  by 


substitution 

and  suitable  rearrangements  of  the  right  hand  side  of  equations 

(1.1.2),  the 

following  forms  of  Q^p  are  obtained: 

a) 

SLOPE  INTERCEPT  FORM: 

QtP  =  *±  +  a.p  ( i=l , 2, . . . ,t) 

b) 

FIXED  POINT  FORM: 

QiP  =  aiP  +  ^1“ai^i  (i=l,2, . . .  ,t)  . 

A.  is  the  fixed  point  of  this  last  transformation  because  Q^p  =  p  if 
and  only  if  p  =  A  .  For  either  form, 


(1.1.3) 

\q  =  1  ■  Qip 

by  condition 

(l.i.i). 

In  order  that  Q^p  ( i=l,2, . . . , t )  satisfy  the  conditions 

0  <  QiP  <  1 

for  all  p  satisfying  0  <  p  <  1,  it  is  necessary  to  impose  restrictions 

on  the  parameters  a  ,  b  ,  a.,  and  A..  Since  q  =  1-p  and,  from  equations 

i  l  i  i 

(1.1.2),  QtP  =  ai  and  C^p  =  1-b  ( i=l ,2, . . . , t)  when  p  =  0  and  p  =  1, 

respectively,  the  condition  that  Q^p  be  in  the  closed  interval  [0,1] 


implies  that 
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(1.1.4)  0  <  a.  <  1 

~  i  — 

and 

(1.1.5)  0  <  b.  <  1  (1=1,2,... ,t)  . 

These  conditions  are  sufficient  since  Q^p  =  (l-b^)p  +  a^q  attains  its 
maximum  value  when  b.  =0,  a  .  =  1 ,  and  Q.p  =  1,  and  its  minimum  value 

l  l  l 

when  b^^  =  1,  a^  =  0,  and  Q^p  =  0  ( i=l ,  2, . .  .  ,  t ) . 

Because  a..  =  1-a^-b^  by  definition,  it  follows  immediately  from 
(1.1.4)  and  (1.1.5)  that 

”1  51  ai  ^1  (l=l»2,...,t)  . 

If  cu  >  0,  Q.p(=^i+a^p)  attains  its  maximum  value  when  p  =  1  and 

Q^p  =  a^  +  a^.  This  implies  that 

(1.1.6)  cc^  ^  I  “  (  i=  1 , 2 , .  . .  ,  t )  . 

On  the  other  hand,  if  <  0,  Q^p  attains  its  minimum  value  when  p  =  1 
and  Q^p  =  a^  +  and  this  implies  that 

(1.1.7)  ^  cc,^  ( i=l » 2, .  .  .  , t )  . 

Inequalities  (1.1.6)  and  (I.I.7)  combine  to  give 

(1.1.8)  -a^  <  (x^  <  1  -  a^  (  i=l ,  2, . .  . ,  t )  . 

Finally,  since  X  is  the  fixed  point  of  the  transformation 

QiP  =  a^P  +  (l-ai)Ai  , 

and  0  <  p  <  1,  it  is  required  that  X  satisfy  the  conditions 

0  <  A.  <  1 


(1.1.9) 


(i=l,2,...,t)  . 


' 
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In  the  subsequent  discussion,  the  further  restriction  that 

(1.1.10)  a.  >  0  ( i=l ,2, . . . ,t)  , 

will  be  imposed  to  overcome  practical  difficulties  associated  with  negative 
values  of  these  parameters.  For  example,  the  oscillatory  character  of  a 
sequence  of  response  probabilities  generated  by  successive  applications  of 
a  given  operator  with  a  negative  slope  parameter  is  unrealistic  in  most 
learning  experiments. 

Condition  (1.1. 10)  implies  that  a^  +  b^  <  1  ( 1=1,2, ...» t) .  The 

revised  sets  of  restrictions  are  given  below  with  their  associated  forms: 

I  SLOPE  INTERCEPT  FORM: 

QtP  =  a  +  cup 

(1.1.11)  0  <  a.  <  1,  0  <  a.  <  1-a.  (i=l,2, . . . , t) 

II  FIXED  POINT  FORM: 

QiP  =  aiP  + 

(1.1.12)  0  <  A.  <  1,  0  <  a.  <  1  ( i=l ,2, . . . , t)  . 

REPETITIVE  APPLICATION  OF  A  SINGLE  OPERATOR,  Q 

In  the  fixed  point  form  of  equation  (1.1.12),  the  operator  (with 
the  restrictions  that  0  <  cu  <  1  and  0  <  A^  <  l)  is  applied  to  p 
whenever  the  event  E^  occurs  ( i=l,2, . . . , t ) .  After  occurs  twice  in 

succession,  the  probability  of  response  A^  is 

Qfp  =  Qi(Qip)  =  ai[aiP  +  (l-ai)Ai]  +  (1-cu^ 

=  a?p  +  (l-a?)Ai  (i=l,2,  —  , t)  . 


4 
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This  suggests  the  induction  hypothesis, 

Q?P  =  cup  +  (1-apV  (i=l,2, . . . ,t)  . 

Under  this  hypothesis, 

_n+l  (rn  \ 

Qt  P  =  Qj^CQiP) 

=  a^p)  +  (l-ai)Aj, 

=  a^^Ca^p  +  (l-a?^]  +  (l-a^A^ 

n+ 1  / .  n+ 1  \ 

-  ai  P  4*  (  '  i  * 

Therefore,  if  occurs  n  times  in  succession,  the  probability  that 

response  occurs  is 

(1.1.13)  Q"p  =  a“p  +  (l-a")\  (1=1,2 . t) 

after  the  last  occurrence. 

Now,  when  0  <  cu  <  1 

lim  Cj"p  =  A  (1=1,2 . t)  , 

n  — ►  oo 

and,  therefore,  the  fixed  point  A^  is  the  limit  point  of  the  operator  Q^. 

In  other  words,  if  the  same  event  (from  among  the  set,  E^,  E  ,  ...»  Et) 
occurs  repeatedly,  the  probability  of  response  stabilizes  at  some  fixed 

value.  Moreover,  equation  (I.I.I3)  provides  a  means  by  which  this  response 
probability  may  be  computed  after  a  given  number  of  repetitions  of  this  event. 

Figure  1.1  illustrates  the  effect  of  the  repetitive  application  of 
an  operator  Q_^  on  p,  where  p  =  0.1,  A^  =  0.8  and  =  0.9« 
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Trials,  n 

Fig.  1.1  Values  of  Q^p  plotted  against  trial  number  n. 

Q?P  =  a^P  +  (l“ajj)\  with  p  =  0.1,  ^  =  0.8  and 

=  0.9  were  used  in  plotting  the  curve. 

(Reproduced  from  [1],  page  60) 


SECTION  TWO 


The  model  described  above  is  applied  to  three  categories  of  learn¬ 
ing  experiments,  namely,  those  involving  experimenter-controlled  events, 
those  with  subject-controlled  events,  and  those  for  which  the  events  are 


experimenter-subject-controlled.  The  events  E^,  E^,  ...,  Et  are  said  to 
be  experimenter-controlled  if  they  occur  with  fixed  probabilities  jt0, 

...»  Jtt>  respectively,  chosen  by  the  experimenter.  If,  in  the  case  t  =  2, 
we  let  the  event  E^  be  the  choice  of  response  A.^  and  the  event  E0  be 
the  choice  of  response  A^,  these  events  are  subject-controlled.  Finally, 
if  outcomes  0^  and  0^  occur  with  fixed  probabilities,  and 

respectively,  chosen  by  the  experimenter,  and  we  let  E^  be  the  occurrence 
of  response  A.  and  outcome  0.  (i,j,  =  l,2),  the  events  E^,  E^,  ^l’  an<i 


j 


E  are  called  experimenter-subject-controlled  events.  Learning  processes 
involving  experimenter-controlled  events  are  the  subject  of  this  thesis. 


■ 


10 


An  example  is  provided  by  the  Brunswick  T-maze  experiment. 

Suppose  that  on  each  of  a  succession  of  trials  a  hungry  rat  is 
placed  at  the  base  of  an  elevated  T-maze  and  allowed  to  run  down  the  maze 
to  a  choice  point  where  it  either  turns  left  or  right.  On  each  trial,  food 
(the  reward)  is  placed  in  a  box  at  the  right  or  left  side  of  the  T. 

On  each  trial,  then  the  rat  is  given  the  opportunity  to  choose 
between  a  right  and  left  turn,  and  either  choice  is  followed  by  exactly  one 
of  the  two  outcomes,  reward  and  non-reward.  Suppose  it  is  assumed  that: 

(1)  Turning  right  is  rewarded  a  fixed  proportion,  of  the 

trials  when  the  rat  went  right  and  turning  left  is  rewarded  a  fixed  propor¬ 
tion,  =  1-jt^,  of  the  trials  when  the  rat  went  left. 

(2)  A  rewarded  right  turn  and  an  unrewarded  left  turn  have  the 
same  effect  on  the  probability,  p,  of  a  right  turn  and  so  constitute  an 
event  class,  E^,  with  associated  operator,  Q^. 

(3)  Similarly,  a  rewarded  left  turn  and  an  unrewarded  right  turn 
constitute  an  event  class,  E^,  with  associated  operator,  Q^. 

Then,  the  probability  of  E^  is  pjt^  +  ( 1  - p ) ( 1  )  =  A}.  anc* 

(l-p)?^  +  p(I-jt^)  =  jt0,  so  that  the  conditions  of  the  general  definition 
are  met  and  this  is  an  example  of  a  learning  experiment  with  two  experiment¬ 
er-controlled  events. 


PRE -ASYMPTOTIC  DISTRIBUTION  THEORY 

Consider  the  case  of  two  experimenter-controlled  events,  and  hence, 
two  operators,  and  Q^,  which  are  applied  with  probabilities  and 

jt  .  Assume  that  +  *2  =  1  and  let  the  or§anlsm's  probability  of 

choosing  response  A1  be  p  .  At  the  end  of  the  first  trial  the  probability 


. 

■ 

, 


11 


of  response  is  with  probability  and  Q2pq  with  probability 

2  p 

3*2’  an<*  after  the  second,  is  Q^Pq  with  probability  jt^,  ^^l^o 

probability  Tiy  ^i^2P0  with  probability  Tt^ ,  and  Q^p^  with  prob- 
2 

ability  jt^.  This  process  carried  through  three  trials  is  illustrated  by 
a  tree  diagram  in  Figure  1.2. 


If  p  denotes  the  probability  of  response  A  on  trial  n,  p 
n  In 

has  one  of  2n  possible  values,  say,  p  , ( i=l , . . . , 2n) .  Let 


ni 


P  ,  =  P[p  =  p  J 
ni  Lrn  rni 


(Ul,2,...,2n). 


Then,  in  terms  of  a  large  population  of  organisms,  each  having  an  initial 

probability  p  of  choosing  response  A  P  ,  is  the  proportion  of  these 

0  1  ni 

organisms  that  would  have  a  probability,  p^ . ,  of  choosing  response  A^ 
after  trial  n  (i=l,2, . . . ,2°;  n=l,2,...). 

In  the  form  of  equation  (l.l.ll),  Q.p  =  a.+QLp  and  Q  d  = 

'  In  1  In  2  n 

a^+G^p^,  so  that  an  organism  with  response  probability  p  ,  at  trial  n, 

has  response  probabilities  Q.p  .  =  a^+aLp  .  and  Q_p  .  =  a_+C*Lp  .  with 
r  r  1  ni  1  1  ni  2  ni  2  2  ni 

conditional  probabilities  and  jt^,  respectively,  at  trial  n+1 

(i=l,2, . . . ,2n;  n=l,2,...).  Thus,  an  organism  at  trial  n+1  will  have 
response  probabilities  Q^p^  anc*  ^2Pni  probabilities  jt^P^  and 

Pni>  respectively  (  i=l , 2, .  . .  ,2n ;n=l , 2, . .  . ) . 


Let  p^  denote  the  expected  probability  of  choosing  response  A 
at  trial  n  (In  terms  of  a  population  of  organisms,  p^  is  the  average 
probability  of  response  A^  for  this  population  on  trial  n).  Then,  with 


Trial  No. 


0 


1 


2 


3 


Value  of  p 
Proportion  p 


P 

1 


Fig.  1.2.  The  successive  splits  that  a  large  group  of  animals  goes  through 


CVI  CVl 
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1’ 


after  three  events  when  the  probability  of  applying  operator  is  jc 

given  an  operator  is  to  be  applied  (experimenter-controlled  events). 

The  proportions  in  the  groups,  indicated  beneath  the  p  value  of  the  group, 

have  been  written  to  parallel  the  Q’s  rather  than  the  simplest  form-thus, 

under  Q0Q,Q0P>  we  write  nxx  rather  than  (Reproduced  from  [1], 

r,A\d  id  did  id 

page  70). 


where 


Let 


so  that 


„n 


P  =  )  P  •  P  .  >  and 

n  /  ,  m  ni 


i=l 


n+1 


,n+l 


i=l 

„n 


p  .  .  P  .  . 
n+ 1 ,  i  n+ 1 ,  l 


■  A  K“l4®lpni)*lPni+(“a4aapiii)*2Pni1 


i=  1 


.n 


.n 


=  ("iai+«2a2)  A  +  ("lal+*2ft2)  A  Pnifni 


i=l 


i=l 


=  a  +  a  Pn  , 


a  =  Jr1a1  +  and 


a  =  +  ^2a2 


a  =  (l-a)A  and 

Qp  =  a  +  a  p 


A+i  ■ 5  pn  -  ^n+lpo  ■  «  A  +  (1-^)A 


A  difference  equation  of  this  type  has  been  solved  and  its  solution  given 
in  equation  (l.l.l^).  Thus, 


p  +  (l-a*1)^ 
ro 


(1.2.1) 


-n 

a 


MOMENTS 


th 

Let  V  denote  the  m  1  raw  moment  of  the  distribution  of  the 
m,n 

response  probability,  p^,  where  P[p^  =  PnA  =  pnj[  and  an  organism  on 
trial  n+1  has  response  probabilities  Q^p^  and  ^2^ni  w^t^1  probabilities 

«i  Pni  and  jtg  Pni,  respectively  ( i=l , 2, . . . ,2n) .  Then, 


„n 


V 


m 


=  )  (P  Jm  p  vn  =  1»  and 
,n  ni7  ni  0,n 


i=l 


,n+l 


m,n+l 


A  (pn+l,A  Pn+l,i 


i=l 

_n 


■  /.  “lPni  +  (a2+a2Pni^m  *2  Pni] 


i=l 


2n  m 


V  V  /ms  m-k  k  /  %k 
■  *1  /,  /,  (k>  31  *1  Pni  Pni 


+  It. 


i=l  k=0 

n 

2  m 


i=l  k=0 


,mN  m-k  k  /  Nk 

(k>  a2  a2  (Pni5  Pni  ' 


Thus , 


(1.2.2) 


m 


m 


r,  /mv  m-k  k  tf  V  /mN  m-k  k 

V  i=  /  (i  )  ai  an  V  +  it  )  (,)  a  a0  V, 

m,n+l  1  k7  1  1  k,n  2  k7  2  2  k, 


n 


k=0 


k=0 


Thus,  V  ,  may  be  computed  from  the  first  m+1  raw  moments,  V  , .  .  .  ,V 
m, n+1  7  0,n  m, 

of  the  distribution  of  response  probabilities  on  trial  n. 


In  particular, 


* 


(1.2.4) 


where 


(1-2.5) 


Let 
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V2,n+1  "  l7Tial+3t2a2^  +  2^lalal+5t2a2a2^Vl,n 


+  U'lal+Jt2a2)V2,n 


=  CL  +  C.V,  +  C  V_ 

0  1  l,n  2  2,n 


C0  *lal  +  *2*2 


C,  = 


=  2(«1a1a1  +  Tr2a2a2^ 


C2  =  +  it2a2 


a  =  n1a1  +  *2a2 


a  =  rtlal  +  ,t2a2 


QP  =  a  +  ap  , 


and  define  V,  by  the  relation, 

1  ,oo 


(1.2.6) 


1 ,00 


1-a 


so  that  (1.2.3)  may  be  expressed  as 


-n+ 1 


V.  .  =  QV.  =  Q  V  _  =  aV.  +  (l-a)V. 
l,n+l  1 ,n  1,0  1 ,n  '  1, 


00 


The  solution  of  this  difference  equation  is  given  in  equation  (I.I.I3). 
Thus , 


(1.2.7) 


l.n  -  a“  vi,o  + 


00 


■  V1,»  -  Vl,0> 


_n 

a 


By  substituting  this  result  in  (1.2.4),  the  following  is  obtained: 
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( 1.2,8) 


?2,n+l  -  <VClVl,oo>  '  W  -  Vo)*”  +  C2V2,r 


Let 


so  that 


C'  =  cn  4-  C-V.  and 

0  0  1  l,oo 

c’  =  -cfv.  -  V,  ) 

1  1  1,00  1,0' 


(1.2.9) 


V2,u+1  ■  Ci  +  C1  ““  +  C2V2,n 


Since 


V2,0  =  'lo  =  P0  ' 

V2,1=C0+CI+C2P0  • 


Thus , 


and 


2,2  ■  c0  +  C1  a  +  C2(C0  +  C1  +  c2  P0> 

-  C^(l+C2)  +  C'(a+C2)  +  C2  p* 


.2  2. 


v2,5  “  co  +  CF  +  c2[c-(i+c2)+c;(a+c2)+c^] 

=  C^(l+C2+C2)  +  C|(a  +C2a+C2)  + 


These  results  suggest  the  induction  hypothesis, 

n-1  n-1 

(1.2.1°)  V2,n=CiI 

u=0  u=0 


CU  +  C' 
2  '  1 


_n»l-u  u  n  2 
a  C2  +  C2pQ 


Assuming  that  (1.2.10)  is  true  for  n  =  l,2,...,k  we  have,  for  n  =  k+1 
[and  using  (1.2.9) ]» 


IT  - 


2 ,  k+ 1  0 


=  ci 


>k 

1  a 


CU  4.  C' 
C2  +  °1 


u=0 


k-1 

k-1 

[ci  y  c 

l 

-1-u  u 

2 

o 

1! 

3 

c 

it 

o 

k 

V  -k-u 

_u  k+1  2 

1  a 

c  4-  C  n 

2  2  F0 

y 

u=0 

n  =  k+l 

in  (1.2.10). 

Hence 

is  true  by  induction. 


If  C2  /  1  and  a  /  C2  , 


n-1 

,  „n 

V" 

l 

1-C_ 
ru  2 

°2  ”  1-C, 

> 

u=0 

2 

n-1 

-] 

a 

v 

-n-1 -u  „u 

/ 

a  Cc, 

11=0 

d 

n  n 
“  °2 


C 


2 


and  hence, 


(1.2.11) 


1-C 


V 


2,n 


n  _n  n 

■P  a  "CP 

co  +  ci 


\  2 

>  +  °2  P0 


,n 


_n  n 
a  -u. 


1-C  ^  v 

=  (C0+C1V1#00)  Y_c2  -  Ci(Vi,oo"Vl,0)  CL  -  c'2  +  C2  P0 


Since  =  1, 


a  =  +  ^2^2  * 

r  2  2 

=  it  1<x1  +  it2a2  , 


and,  from  (1.1.12), 


0  <  at  <  1  (i-1,2)  , 


0  <  C  <  1 


0  <  a  <  1  > 


it  follows  that 


18  - 


and,  therefore,  that  lira  V_  exists.  Let 

*  n 


n  — *  oo 


V  =  lim  V 
2,oo  2,n 

n  ->  oo 


2  2 

and  <T  (=  V  -  Vn  )  be  the  limiting  variance.  Then. 
00  2,oo  1  ,oo/  ° 


Cn+CiVi 

J  _  0  1  1  ,00 

2,00  "  1  -  C 


and 


(1.2.12) 


0  C+C.V, 

n-2  _  _0  -,1 

«  "  i  -  c2 


-  V 


1  ,00 


ASYMPTOTIC  DISTRIBUTION  THEORY 


A  sequence  of  distributions  of  response  probabilities  has  been 
defined  in  association  with  the  sequence  of  trials  in  a  learning  experiment 
with  two  experimenter-controlled  events.  The  proof  of  the  existence  of  a 
limiting  distribution  for  this  case  and  the  "trapping  theorem"  stated  below 
are  due  to  Karlin  [8].  Both  results  are  quoted  from  [l],  Page  99  (where 
"p  values"  refer  to  response  probabilities)  and  the  proofs  are  omitted. 

ASYMPTOTIC  DISTRIBUTION  THEOREM.  For  two  response  classes 
and  experimenter-controlled  events,  it  has  been  shown  that 
an  asymptotic  distribution  exists,  that  is,  that  the  p 
value  distributions  on  trials  n  and  n+1  may  be  made  as 
close  to  one  another  as  desired  by  making  n  sufficiently 
large.  Furthermore,  the  distribution  is  independent  of  the 
initial  distribution  of  p  values  .... 

Karlin  [8],  page  727  states  that  in  a  subsequent  paper  the  theory 
of  [8]  will  be  adopted  for  more  general  models,  where,  for  example,  more 
response  classes  are  allowed. 


TRAPPING  THEOREM  FOR  r=2  ....  consider  two  event  operators 
having  unique,  limit  points  and  non-negative  a's,  that  is, 
operators  for  which  0  <  a.  <  1.  The  theorem  asserts  that 
if  the  p  value  of  any  sequence  is  ever  between  the  two 
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limit  points  the  sequence  forever  remains  there;  this 
part  of  the  theorem  is  intuitively  obvious ,  for  neither 
operator  can  take  a  p  value  outside  the  interval.  The 
theorem  also  asserts  that,  when  the  sequence  is  outside 
the  interval,  it  will  later  enter  the  interval  with  pro- 
bability  1,  provided  that  the  probability  of  application 
of  each  operator  is  never  zero.  For  more  than  two  event 
operators  with  unique  limit  points  and  non-negative  a's, 
the  same  statements  apply  to  the  interval  between  the 
largest  and  smallest  limit  points  .... 


References  to  this  theorem  will  be  made  in  the  course  of  the  subsequent 
discussion. 


The  first  and  second  raw  moments,  as  well  as  the  variance,  of  the 
limiting  distribution  have  been  determined  and  are  given  in  equations  (1.2.6) 
and  (1.2.12).  These  results  and  the  problems  to  be  discussed  in  the  succeed¬ 
ing  chapters  are  simplified  when  the  equal  alpha  condition,  which  arises 
when  a  symmetry  requirement  is  placed  on  the  events  E^  and  with  oper- 

•V 

ators  Q^p  =  o^p  +  (1-a^)^  and  Q2p  =  +  (l-a^)^,  is  imposed.  If 

Is  the  complementary  operator  of  so  that 


V  ■ 1  ■  V 

=  1  -  a2P  -  (1"a2^2 

=  agq  ♦  (l-a2)(l-^2)  > 


we  define  this  symmetry  requirement  in  the  following  way;  Q^p  is  the  same 
function  of  p  as  Q^q  is  of  q.  This  is  true  if  and  only  if 

al  =  a2 

and 


1  - 


=  A 


1 


or,  in  other  words,  the  symmetry  requirement  is  met,  if  and  only  if,  for 
operators  and  Q^,  the  slope  parameters  are  equal  and  the  limit  points, 


complementary . 


O  -  X  m  p 

,  (  *  = 

/  ■  •'  -  ! 

, 


20 


With  the  restriction  that  =  a0  =  a,  (1.2.6),  (1.2.5),  and 


(1.2.12)  become 


(1.2.13) 


Vl,»  =  "lAl  +  n2A2  ’ 


(1.2.14) 


c0  =  (1-a)  (*x\  -i-  *2^2) 


Clm  a°'(1  “  cc)Vi, 

2 


00 


C2  =  a 


and 


(1.2.15) 


respectively, 


r 


< 


V2,~  =  TTZ  ((l-a)(«x^+)t2^)  +  &V  ] 


^  1  ~  (X/  2  ^  2  .,2  \ 

<T2  =  (Vl  +  "2A2  -  V1,J  > 


00 


SECTION  THREE 


Consider  a  learning  process  with  t  experimenter-controlled 
events,  E^,  E^,  ...»  and  associated  operators,  Q^,  Q^,  ...,  which 

are  applied  with  fixed  probabilities  jt^,  tc  ,  ...,  respectively.  Then, 

if  p^  denotes  the  probability  of  response  on  trial  n,  p^  has  one 

of  tn  possible  values,  say,  p^  ( i=l , 2, . . . , tn) .  Assume  V"  ^ t  =  i  and 


let 


i=l 


P  .  =  P[p  =  p  .  ] 
ni  n  ni 


(i=1.2 . tn). 


If  p^  denotes  the  expected  probability  of  choosing  response  A^  at  trial 
n*  P0  =  P0’  n 


Pn  =  2,  Pm  Pni  ' 
i=l 


21 


=  a  +  a  p 


n 


y 


where 


a  =  j  7t^a^  and  a  = 

j=i  j=i 


tt  .a . 
J  J 


The  solution  of  this  difference  equation  is  given  in  (1.2.1)  and  hence, 


(1.3*1) 


Pn  =  a11  PQ  +  (l-an)A  , 


where 


t  h 

If  V  denotes  the  m  raw  moment  of  the  distribution  of  the 
m,n 

response  probability  p^  , 


j=l  i=l 


22 


By  using  the  binomial  expansion  and  interchanging  the  order  of  summation  over 
i  and  k,  the  following  is  obtained: 


m 


(1.3.2) 


m,n+l  A  j 


,itk  m-k  k  „ 

(k)  "j  aj  Vk.n 


j=l  k=0 


In  particular, 


L 

Vl,n+1  =  I  "j 


J-l 


=  a  +  a  v 


1  >n  * 


where 


and 


a  =  >  it .  a . 

A  J  J 
j=l 


a  =  7  it ,  a . 

A  J  J 


j-l 


This  difference  equation  has  been  solved  and  its  solution  given  by  (1.2.  7) 
Thus , 


(1.3.3) 


V  =  V  -(V  -  v  ) 

l,n  1,00  ^vl,oo  vl,<y 


_n 

a 


where 


l,oo  1  -  a 


Also , 


2,n+l 


I  ”j  (aj+2ajajVl,n+aK>n) 

J-l 


t  t 

=  I  "j  a2j  +  2(  I  *JaJ0j)Vl.n 
j=l  j-l 


+  U  ^  ja  j  /  2 ,  n 
J-l 


=  C0  +  C1  Vl,n  +  C2  V2,n  ’ 


(1.3.1*) 


•:  ■  •  • 


(s.t.i) 


' 


■ 
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where 


0 


j  =  l 


2 

n  .  a  ,  , 

J  J 


(1.3.5) 


t 

2 1  njajaj 
j=l 


j=l 


2 

it  du. 
J  J 


Substituting  V  =  V  -  (V  -  V  )a,n  in  (1.3. 4)  gives 

l  j  n  i  j  oo  i  y  oo  i  j  u 

V2,n+1  "  <C0  +  C1VX,J  -  Woo  -  Vl,0^n  +  C2V2>n 


with  solution,  given  in  (1.2.11), 


(1.5.6) 


l-C 


V2,n  =  <C0  +  C1V1,W 


'  W*>  -  YoW;  +  C2»0 


provided  that  /  1  and  a  /  . 


Since 


j=l 


t 

-  V1 

a  =  L  *jaj 

j=i 


C2  “  L  *jaj  ’ 

j=l 


and,  from  (1.1.12), 


0  <  a  <  1 


(i-l,2,...,t)  , 


it  follows  that 


o  <  c2  <  1 

0  <  a  <  1 


- 

-£> 


■ 


. 


and  hence,  that  V 


-  2k  - 


-  =  lim  V_  exists  and  is  given  by 

2 ,°°  „  2,n 

n  — ►  (» 


(1-5- 7) 


Cn  +  CiVi 

y  _  _0 _ 1  1  ,00 

2,00  "  1  -  C 


2  2 

The  limiting  variance,  <r  =  (=V^  -  V,  )  is 

“  2,oo  1  ,oo 


00 


(1-3-8) 


c r  8 

00 


Cn  +  CiVi 

0 _ 1  1  ,00 

1  - 


-  V 


1,00 
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CHAPTER  II 

A  REVIEW  OF  METHODS  USED  IN  DETERMINING 
LIMITING  DISTRIBUTIONS 


The  explicit  form  of  the  limiting  distributions  of  response  prob¬ 
abilities  for  processes  involving  two  experimenter-controlled  events  is  known 
for  only  a  few  special  cases  in  which  the  equal  alpha  condition  has  been 
imposed.  Most  of  these  known  limiting  distributions  have  been  determined 
by  Kemeny  and  Snell  [9]  and  McGregor  and  Hui  [ 10 ] .  The  methods  devised  by 
these  authors  and  the  results  which  have  been  obtained  by  them  are  reviewed 
in  this  chapter. 


Section  one  is  devoted  to  a  study  of  the  work  of  Kemeny  and  Snell 
who  consider  the  case  for  which  =  a,  =  1  and  A^  =  0;  that 

is,  the  case  involving  two  operators,  Q^p  =  ap  +  (l-a)  and  Q^p  *  ap  [It 
will  be  shown  in  Section  two  that  the  limiting  distribution  may,  by  a  suit¬ 
able  transformation,  be  made  independent  of  A^  and  A^].  ^or  t^s  case> 
it  is  shown  that  the  response  probability,  p^,  is  distributed  like  the 
quantity 


S  =  aDp  +  /  €.(l-a)a^ 

n  0  Z_j  J 

J-l 


where  pq  is  the  initial  probability  of  response  A^  and  the  (j=l,2,...) 
are  independently  distributed  random  variables  such  that  6,  =  1  or  0  with 
probability  and  itg,  respectively.  Then,  the  limiting  distribution  is 

the  same  as  the  distribution  of  the  quantity 


rtodt.  el  31 


-  26  - 


which  is  independent  of  p  .  By  considering  this  random  variable  and  a 
recursion  relation  for  the  raw  moments  of  the  limiting  distribution,  explicit 
forms  for  the  limiting  distribution  are  obtained  in  a  number  of  cases.  The 
characteristic  function  of  the  limiting  distribution  is  shown  to  be 


E(eiCS) 


00 

=  n 


j=i 


(*2  +  e1^1-^’1] 


McGregor  and  Hui,  whose  work  is  summarized  in  section  two,  obtain 
the  corresponding  characteristic  function  for  general  A^  and  A^  and  show 
that  by  means  of  a  transformation  of  the  limiting  variate,  it  may  be  made 
independent  of  A^  and  A^.  The  characteristic  function  so  obtained  is 
inverted  in  special  cases  to  yield  explicit  forms  of  the  corresponding  limit¬ 
ing  distributions. 


SECTION  ONE 

Consider  the  special  case  of  a  Bush-Mosteller  learning  process 
with  two  experimenter-controlled  events,  E^  and  E^,  for  which  A^  =  1, 

A^  =  0,  and  =  a^  =  a^  1.  Let  the  probabilities  of  E^^  and  E^  be 
and  respectively.  Then,  a  probability,  p,  of  response  A^,  becomes 

Q^P  =  ap  +  (l-a)  and  Q^p  =  ap  with  probabilities  and  jt^,  respect¬ 

ively. 


Let  fq  be  the  initial  probability  of  response  and  let  the 

random  variable  p^  be  the  probability  of  response  A^  after  trial  n. 

Then  p.  is  ap  +  (l-a)  or  aP  and  p  is  a  P  +  a(l-a)  +  (l-a), 

1  o  O  C.  o 

a2po  +  (l-a),  a2P0  +a(l-a)  or  a2p^  .  It  is  clear  that  the  possible  values 

of  p  may  be  written  as 
n 


.  ' 


. 
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n 


(2.1.1) 


anpQ  +  ^  °  j  ( l”a)a^  1  * 


j=l 


where  is  0  or  1.  Moreover,  is  1  if  and  only  if  is  applied 

on  trial  n-j+1  ( j=l,2, . . . ,n) ,  so  that  the  probability  of  a  particular 

value  of  p  ,  written  in  the  form  of  (2.1.1),  is  r  where  r  is 

n  12 

the  number  of  c.  which  have  value  1. 

J 


Thus,  the  distribution  of  p  is  the  same  as  that  of  the  random 

rn 


variable 


n 

Sn  =  anpQ  +  € j ( l_a)a^  1  (n=l,2,...)  , 

>1 

where  (j=l,2,...)  are  independent  random  variables  such  that  is 

1  or  0  with  probabilities  and  n  ,  respectively.  The  limiting 

distribution,  whose  existence  is  assured  by  the  asymptotic  distribution 
theorem,  is  the  distribution  of 


00 


(2.1.2) 


S=  £  €j(l-<x)a^ 
J~  1 


Let  F  be  the  distribution  of  S  .  Then, 
n  n 


and 


E[(s  i)^  Is  =  x]  =  jt  [ax  +  (1-a)]^  +  7t0[ax] 
n+ln  1  2 


E[(sn+1)J]  =  *j  J  [ax+(l-a)]'i  dFn  +  F  [ax]J  dFR 


=  n1  ^  (^)ak(1-a)J'k  E[S^]  +  *2aJ  E(S J]  . 
k=0 


(2.1.3) 


. 
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A  recursion  relation  for  the  raw  moments  of  the  limiting  distribution  is 
obtained  by  letting  n  tend  to  infinity  in  (2.1.3)*  Thus, 


E[SJ] 


=  It 


1  Z,  k 
k=0 


^  E [ E[sb 


(2.1.U) 


^  0ak(l-a)j-k  E[Sk]  , 
1_a  k=0 


where  E[S  ]  =  1  , 

•  • 

E[SJ]  =  lim  E[SJ]  for  j=  0,1,2,... 


n  — >  oo 


n 


We  observe  that 


lim  E[Sj] 
a  -  1 


i-l 

/  (jj)  (j-k) 

klo 


lim 


ll-a) 


j-k- 1-, 


>-a  -*  1  jaZ 


lim  E[S  ] 
L  a  — ►  1 


=  jr  lim  E[SJ_i] 
1  a  -  1 


=  Tt . 


and,  hence,  that  the  limiting  distribution  for  a  =  1  is  concentrated  on 


the  point,  jt^  . 


THE  DISTRIBUTION  OF  S  FOR  THE  CASE  a  <  I 


It  has  been  shown  that  S  is  independent  of  the  initial  response 
probability,  pQ(=So).  For  simplicity  it  will  be  assumed  that  pQ  =  0  and, 


hence,  that 


n 


5  =  )  (n  =  1,2,...) 


(2.1.5) 


j=l 


S  = 
o 


0 
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Suppose  that  the  values  of  e  »  €  ,  e  are  known  and  that 

id  n 

the  corresponding  value  of  is  denoted  by  x.  Assume  a  =/  Then,  if 


1 


S  >  x  +  (l-a)a11 


while,  if  €n+^  *  0,  S  cannot  exceed  the  result  obtained  when 
(r  =  n+2,  n+3,  ...).  That  is, 

00 


S  <  X  + 


k=n+2 


€ 

r 


1 


n+1 


=  x  +  (l-a)  [-p~  -  ] 

1  -a  1  -a 


=  x  +  a 


n+1 


Now,  x  +  a,n+^  <  x  +  (l-a)a11  when  a  satisfies  the  conditions 


0  CL  K  ^  . 

Therefore,  if  =  x,  S  is  outside  an  open  interval, 

(2.1.6)  (x  +  a,n+  ^ ,  x  +  (l-a )a,n )  , 

of  length  an(l-2a).  It  will  now  be  shown  that  the  intervals  obtained  in 
this  way  by  considering  all  possible  values  for  S^,  for  all  possible  n, 
are  disjoint  and  of  total  length  1. 


Let  S  .  (i=l,2, . . . ,2n),  with  S  .  <  S  ,  denote  the  2n  poss- 
ni  '  '  ni  n,i+l 

ible  values  of  S^,  each  of  which  may  be  written  as 


c j ( l-a)a^  1 

j=l 


> 


» 
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where  c  is  0  or  1.  Suppose  S  .  and  S  with  S  .  >  S  .  ,  are 
J  ni  nj’  ni  nj  * 

given  by 


n 


Sni  =  X  aj(1'a)aJ*1 


and 


j=l 


n 


!nj  =  Y  bjd-ajJ"1  . 


j=l 

Then,  a.  =  b.  for  j  =  l,2,...,m-l  while  a  ,=  1  and  b  ,  =  0  for 
J  J  mm 

some  m'  such  that  1  <  in  <  n  .  Consequently, 


n 


n 


Sni  -  Snj  “  I  aj ( 1~a)a^"1  -  £ 

•  /  1 

=m  j  =m+ 1 


*  , 

m-i.  »  in.  ii-uiv 
>  a  (1-a)  -  a  (1-a  ) 


r  j 

m, ,  n-m, 


n  m- 1 y  . 
-  a  +  a  ( l-2a) 


From  this  result,  it  is  clear  that 


(2.1.7) 


Sni  "  SnJ  -  a  (I-®)  (1,  j-1,2 . 2  J  1  /  j)  , 


ni  nj 


and,  since  a*1  ^(1-a)  >  an(l-2a),  the  intervals  obtained  for  a  given  n 
are  disjoint. 


Assume  that  some  pair  of  intervals  are  not  disjoint.  Then, 

there  exist  m,  n,  i  and  k,  with  m>n,  1  <  i  <  2°  and  1  <  k  <_  2™, 

for  which  one  of  the  following  is  true: 

/ 1  \  c  n+ 1  _  m+ 1  n .  » 

(1)  S.+a  <  S  ,  +  a  <  S  .  +  a  (1-a) 
v  '  ni  -  mk  —  ni  '  ' 


(2)  S  .  +  a11+^  <  S  .  +  am(l-a)  <  S  .  +  an(  1-a) 
v  '  ni  —  mk  v  '  —  ni  ' 
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If  (l)  is  true. 


(2.1.8) 


o  ,  n+1  m+1  n.  v  m+1 

+  a  ”  a  —  S  s„,*  +  a  vl~aJ  -  a 


mk  —  ni 


But  S  .  can  be  written  as 
mk 


m 


S  .  =  S  + 
mk  nt 


y  c j ( l~a)a^  1  , 


j=n+l 


.n 


where  c.  is  0  or  1  and  1  <  t  <  2  .  If  1  <  t  <  i,  we  have,  using 
(2.1.7), 


(2.1.9) 


q  .  q  n"l/'l  \  0  01 

Smk  ^  Sni  '  a  (1'a)  +  a  '  a  ' 


Inequalities  (2.1.8)  and  (2.1.9)  imply  that 


or 


n+1  m+1  ^  .  n  n-1  m 

a  -a  <  2a  -  a  -a 


, ,  N  n-1  m  _ 
( l-a)a  +  a  <0 


which  is  false  for  0  <  a  <  1.  If  t  =  i,  (2.1.8)  becomes 


m 


(2.1.10) 


n+1 


a 


m+1  . 
-a  < 


j=n+l 


c ^ ( l-a)a 


j-1  ^  n 


<  an(l-a)  - 


m+1 


a 


But,  if  c  .  =  1 , 
n+1 


m 


y  c^(l-a)a^  ^  >  an(l-a)  >  a,n(l-a)  -  a 
j=n+l 


m+1 


while  if  c  .  =  0, 
n+1 


m 


m 


V  /,  \  j-1  V1  /-i  \  j-1  n+1  m  n+1  m+1 

)  Cj(l-a)aJ  <  )  (l-a)aJ  =  a  -  a  <  a  -a 


j=n+l 


j=n+2 


n 


for  0  <  a  <  1.  Therefore  i  <  t  <  2  .  But,  in  this  case 


. 


. 


■  V< 
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Smk  -  Sni  +  ^  >  Sni  +  “  a™*1 


which  implies  a  contradiction  to  (l).  Thus,  there  exist  no  m,  n,  i,  and 
k  for  which  (l)  is  true.  A  contradiction  to  assumption  (2)  is  implied  by 
similar  arguments.  Hence,  the  intervals  of  the  form  given  in  (2.1.6)  ob¬ 
tained  by  considering  all  possible  values  of  S^,  for  all  possible  n,  are 
disjoint . 


For  a  given  n,  there  are  2n  intervals  of  total  length  (2a)n(l-2a). 
Therefore,  the  total  length  of  intervals  obtained  for  n  =  0,1,2,...  is  1. 

Since  the  possible  values  of  S  which  may  be  written  as 


(2.1.11) 


7  Cj(l-a)a^  1 

j=l 


where  c^  is  0  or  1,  are  outside  these  intervals,  the  limiting  distri¬ 
bution  is  concentrated  on  a  set  of  Lebesgue  measure  zero.  In  particular, 
when  a  =  l/3>  (2.1.11)  becomes 

(2.1.12) 


00 


(2Cj/3J) 


j=l 


and  the  limiting  distribution  is  concentrated  on  the  set  of  points  whose 
ternary  representation  excludes  the  digit  1,  that  is,  the  Cantor  set. 


Suppose 


■  I  cj(1'a)aJ'1 


J-l 


is  a  possible  value  for  S  .  Then,  if 

n 


00 


j=n+l 


S  =  x  + 


y 
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S  >  x  and 

00 

S  <  x  +  y  (l-a)cJ-'*'  =  x  +  a11 
j=n+l 

That  is,  S  is  in  the  closed  interval 

(2.1.13)  [x,  x  +  an] 

Any  other  value  of  differs  from  x  by  at  least  a11  ^(l-a)  which 

exceeds  a*,  the  length  of  the  interval  given  by  (2.I.I3).  Thus,  the 

intervals  obtained  for  all  possible  values  of  are  disjoint  and  S  is 

in  the  interval  given  by  (2. 1. 13)  if  and  only  if  e.  =  c.  for  j  =  1,2,...  n. 

J  J 

Therefore,  if  F  is  the  distribution  function  of  S, 

(2.1.14)  F(x+an)  -  F(x )  =  (n  =  1,2,...)  . 

where  r  is  the  number  of  c  .  which  have  value  1.  That  the  function,  so 
determined  is  not  absolutely  continuous  is  shown  below. 

Recalling  the  definition  of  an  absolutely  continuous  function 

we  have: 


A  function  f(x)  is  said  to  be  absolutely 

continuous  in  an  interval  (a,b)  if,  given 

n 


we  can  find  a  6  such  that 


v?i 


f(x  +h  ) 
V  V 

-f(x^)|  <  e  for  every  set  of  non- over lapping 

intervals  (x  , x  +h  )  such  that  Z-  h  <  6 
x  v  v  v'  v=J  v  — 

(See,  for  example,  Titchmarsh  [11]  page  364) 


For  arbitrary  n,  the  intervals  [S^,  S  + 


aU]  (i  =  1,2, 


form  a  set  of  non-overlapping  intervals  for  which,  using  (2,1,14), 

~n 


n 


(2.1.15) 


)  |F(S„i+an)  -  F(Sni)  |  =)  (")«i«2'r 

i=l  r=0 


1 


whereas 


(2.1.16)  ^  a°  =  (2a)n 

i=l 

Since  this  latter  sum  tends  to  0  as  n  tends  to  infinity  it  can  be  made 
less  than  any  prescribed  5  by  making  n  sufficiently  large.  However, 
there  exists  no  6  for  which 

2n 

I  |F(Sni+an)  -P(Snl)|  <« 

i=l 

when  e  <  1.  Therefore  F  is  not  absolutely  continuous. 


THE  CASE  a  =  i 
When  a  =  ■§•,  (2.1.2)  becomes 

a> 

S  =  £  (e./2j)  . 

j=l 


The  set  of  possible  values  for  S  are  all  numbers  of  the  form 


(2.1.17) 


9 


where  c^  is  0  or  1.  But  every  real  number  in  the  interval  [0,1]  can 
be  written  in  the  form  of  (2.I.I7)  or,  in  other  words,  in  the  diadic  form 


Let  [a,b]  be  an  interval  with 


, 


■  yy 


oo 

r 


Since  /  (aj/2J)< 

j=n+2 


,  this  interval  contains  all  values  of  S 


which  are  of  diadic  form 


Therefore , 


G-C—...C  0  a  T  *  *  * 

12  n  n+2  n+3 


F(b)  -  F(a)  =  P[a  <  S  <  b] 


P'€n+1  =  0]  iVt£rcj] 

Jsl  j 


r  n-r+1 
*  1  *2 


where  r  is  the  number  of  c.  which  have  value  1 

J 

when  , 


In  particular, 


F(b)  -  F(a)  =  V2n+1 


while 


b  -  a 


=  1/ n+1 


This  determines  F  as  the  uniform  distribution  in  this  case.  When  jt,  4  jt^  , 

1  <  2 

the  resulting  distribution  is  not  absolutely  continuous  as  shown  in  the 
following  discussion. 


variable 


The  limiting  distribution  is  the  distribution  of  the  random 

'  * 


00 


S  =  ^  (e/2j) 
j=l 


where  is  1  or  0  with  probability  and  >  respectively, 
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For  any  x  in  [0,1],  let  an(x)  the  number  of  one's  in  the  first  n 

digits  of  the  diadic  representation  of  x.  Given  =  p,  let  M(p)  be 

the  set  of  S  such  that  lim  a  (S)/n  =  p,  that  is,  S  e  M(p)  if  and 

n  — >  oo 

only  if  this  limit  holds.  Since  P[€^=l]  =  p,  we  have,  by  the  law  of 
large  numbers,  that  P[SCM(p)]  =  1.  On  the  other  hand,  each  distinct 
value  of  in  [0,1]  determines  a  distinct  set,  of  values  of 

S  so  that  if  p^jp^j...  are  different  values  of 

Ml)il(pm)]  =  Y  L[M(p  )]  , 

la  m  z_ j  m 

m 

where  L  denotes  the  Lebesgue  measure.  In  particular,  since  jt  =  ■§• 
determines  the  uniform  distribution  over  [0,1]  ,  L[M(-|)  ]  =  1  and  since 

U  M(Pm)C[0,l],  it  follows  that  L[M(pm)]  =0,  Pm  /  4*  This  implies 
that  the  limiting  distribution  function  for  /  j?  is  constant  almost 
everywhere  and  hence,  it  is  not  absolutely  continuous. 


THE  DISTRIBUTION  OF  S  FOR  a  >  \  =  \ 

Let  us  define  a  sequence  of  a's  as  =  1/^2  ,  n=l,2,..., 
and  consider  the  corresponding  learning  processes  for  =  2 

[Wintner  [12]  has  shown  that  for  each  member  of  the  sequence  just 
defined,  the  limiting  distribution  of  response  probabilities  is  absolutely 
continuous].  When  a  =  1/^2  ,  (2.1.2)  becomes 


S 


y  8j(i  - 

j=i 


(>■1/®)  it  (w/T* 

k=o 


(i/5T2) 


k=o 


+ 


d/Tzf'1  y 

k-o 
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After  the  e's  are  suitably  relabelled,  this  may  be  written  as 


(2.1.18) 


n 

'T1 

> 

i=l 


a.X. 

1  i 


where 


0  or 


1  with  probability  \  , 


and  X.,,  i=l,2,  n  are  independent  random  variables  each  of  which  has 

the  uniform  distribution  on  the  interval  [0,  1]. 


The  distribution,  say,  F  of  the  random  variable  S  given  in 

an 

result  (2.1.18)  is  the  convolution  of  the  distributions  of  a^X^  (i=l,2,  ...,n). 

Thus,  the  characteristic  function  of  F  is  easily  obtained  and  is  the 

a 

n 

product  of  the  characteristic  functions  of  the  distributions  of  the  summands. 

Kemeny  and  Snell  state  that  F  may  be  found  by  the  inversion  of  its 

an 

characteristic  function  and  that  it  is  piecewise  a  polynomial  function. 

In  chapter  III  another  method  of  determining  F  for  given  n 

an 

is  devised  and  several  solutions  exhibited. 


By  result 


THE  CHARACTERISTIC  FUNCTION  OF  S  . 


(2.1.2)  the  limiting  variate  may  be  expressed  as 


S  =  j  €j(l-a)a^ 

j=i 


where  (e^}  are  independent  random  variables  such  that 
with  probability  and  jr^  ,  respectively.  That  is, 


j 

S 


is  1  or  0 
is  the  infinite 
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sum  of  independent  random  variables  which  are  (l-a)oc^  1  and  0  with 
probabilities  and  >  respectively  (j=l,2,  ...  ).  Thus,,  the 

characteristic  function  of  the  distribution  of  S  is  the  infinite  product 
of  the  characteristic  functions  of  the  distributions  of  the  summands. 


Hence , 


j-1 


,/  its.  fr  r  it  ( l-<x)a  t 

■(e  )  =  1 1  [*2  +  itxe  ] 

j=l 


When  =  jc2 


2 


_ /  its. 
E(e  ) 


«  - 

oo  .  it(l-a',aT* 

^ - ] 

j=l 


-  n 


00 


j-1 


it(l-a)aj  /2  cos[t(l-a)a;,'1/2j 


=  exp(it//2)  ||  cos[t(l-a)a^  ^/2]  . 

J=1 


SECTION  TWO 


As  before,  let  p^  denote  the  probability  of  response 


on  trial  n  ,  pni  (1,2,  ...,  2n)  denote  the  2n  possible  values  of 

p  ,  and  P  .  =  P[p  =  p  .]  (1,2,  ...,  2n).  Let  ra  (t)  =  E(exp(tp  ))  be 
n  m  n  ni  n  n 

the  moment  generating  function  of  the  distribution  of  Pn  •  Then, 


ra  (t) 
n '  7 


2 

V" 

> 

L 

i=l 


tp  . 

Kni  „ 
e  P 


ni 


Since  pn+1  is  QlPn.  -  +  a^.  or  Q2Pni  =  a2  +  <x2Pni 

probability  jt.P  .  and  n0P  .  ,  respectively, 
r  J  1  ni  2  ni 


with 


.n 


ra 


r-  r  t(a1+a1Pni) 

t)  =  >  I e 


'■»!<*>  -l 

i=l 


t(a2+a2pn.) 

*lPni  +  6  rt2Pni 


■ 


' 
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pn  n 

ta  —i  ta  P  ta  r~>  ta„p  . 

*,e  1  )  e  1  nl  P  .  +  *0e  2  )  e  2  nlP  . 

1  l__,  m  2  ni 

i=l  i=l 


taj  ta 

=  j^e  mn(t  ttl)  +  *2e  ™n(t  ap)  . 


By  letting  n  -►oo  ,  a  functional  equation  for  the  moment  generating  function 
of  the  asymptotic  distribution  is  obtained.  It  is 


(2.2.1) 


m 


tai  ta2 

(t)  =  Jt1e  m(t  ax)  +  ^ m(t  a2)  , 


where  m(t)  =  lim  mn(t)  .  When  =  a0  =  a  ,  (2.2.1)  becomes 


n  — *» 


(2.2.2) 


ta  ta 

(t)  =  m(at)  [jt1e  +  *  e  3  • 


This  equation  implies  that 


toa. 


taa. 


m 


(at)  =  m(a2t)  [j^e  1  +  2] 


giving 


m 


p  ta.  ta  to,a_  tcoSp 

(t)  =  m(a  t)[jt1e  +  jt2e  +  j^e  ]  . 


Provided  that  the  resulting  infinite  product  converges,  this  procedure 
may  be  applied  iteratively  to  yield  the  explicit  solution 


(2.2.3) 


m 


n  .  n 

oo  ta  a  ta  a 

(t)  =  n  +  rt26  ^  • 


n=0 


That  this  product  does  converge  is  shown  in  the  following. 


Assume  that  0  <  a  <  1  ,  a^  >  a^  and  t  is  real.  Then, 
using  natural  logarithims, 


I  log  m(t) |  =  | 


00 

/  log  (rtj^e 

n=0 


n  n 

ta^a  ta^a 

+  *2e  ) 


::  ■  ■ 


When  -oo<t<0,t  =  -|t|  ,  and 


00 


I  .  |  n  ,  ,  n 

-  1 1 1  a,cx  -  1 1 1  a  a 


\  I  I  1  ^ 

I  log  m(t)  |  <  /  I  log  (j^e  +  n 


)l 


n=0 


00 


"  L  |  log  Uie 

n=0 


n 


t  la  a 


+  Jt2e 


.  i  n 

t  |aia 


)l 


00  i.i/  v  n 

V  ,  r  |t|(al+a2)a  . 

+  f  log  [e 


n=0 


00  00 

<  ^  |t|aian  +2  |t|(a1+a2)a 

n=0  n=0 


n 


\t\al  ltKa1+a2) 


1- 


a 


1  - 


a 


When  0<t<oo  ,t=|t|,  and 


00 

V1 


I  log  m(t)|  <  '  |  log  (jt1e  1  +  Jf 


t|aian  | t | a 0an 


/ 

n=0 


)| 


00 


<  j  jtja^a11 


n=0 


In  either  case,  | log  m(t) |  is  finite  and,  hence,  m(t)  converges 
for  -oo  <  t  <  oo  . 

If  the  random  variable  of  the  limiting  distribution  is  denoted 
by  p  ,  we  know  from  the  trapping  theorem  [see  Bush  and  Mosteller  [1], 
page  99]  that  P[p  C  I]  =  0  if  I  is  any  interval  which  is  disjoint 


. 


' 


al  a? 

from  the  closed  interval  joining  A  (  =  — -)  and  A_  =  (-r— ).  That  is, 

1  1  ~a  1  “a 

the  limiting  response  probabilities  are  distributed  between  the  limit 
points  A^  and  A  .  Let 

Ac  +  A, 

x  -  [r2— r3-]  -  [r-2® T-] 


■A2  -  Ax 


a2  -  \ 


so  that  x  is  distributed  on  the  interval  (-1,  l).  Let  m  (t)denote 

X 

the  moment  generating  function  of  the  distribution  of  x  .  Then, 


m  (t) 
xv  ' 


=  E 

exp 

or 

(2.2.4) 

mx(t)  =  exp( [ 

Thus , 

m 


E[exp( tx) ] 

A2  +  V 


([vt]t  -  [V\]) 

A„  +  \ 


2  1 


2  1 


(t)  =  exp[(A2  +  Ax)  |j  mx  [-|(A2  -  Ax)] 


This  result  may  be  substituted  into  equation  (2.2.3)  to  obtain  the  explicit 
solution  for  m  (t).  Thus, 


x 


t(l-a)a11A1  t(l-a)anA2 

[-|(A  -A  )]  =  exp[--(A  +A  )]  j|  [it.e  +  ] 

'  d  n=0 


m 


+  JTgS 


or 


m 


x 


A  +A 

(t)  =  exp[(/—  )t] 
2  1 


X 


n=0 


oo  A  +A 

||  exp[-t(l-a)an(A1+A2)/p\2-A1)]  =  exp[-(^  , 

n=0  2  1 


Since 


A  1  -  It  *r  )  1  ■  )  ’ 


-  h2  - 


(2.2.5) 


m 


“  .  / ,  \  n  ^  ,  N  n 

(t)  =  H  i*i*  {l~a)a  +  *pe  (  ]  . 

n=0 


The  characteristic  function  of  the  distribution  of  x  is 
$  (t)  defined  by 

X 

®x(0  =  ®x(it)  » 


where  i  =  -1  .  Thus,  from  equation  (2.2.5)  > 


(2.2.6) 


*  (t) 
x  7 


00 

n 

n=0 


SPECIAL  CASES 

Suppose  jt^  =  jtg  =  i  ♦  Then  (2.2.6)  becomes 

00 

(2.2.7)  ®x(t)  =  fj  cos[t(l-a)an]  . 

n=0 

Examples  of  the  types  of  limiting  distribution  to  which  this  characteristic 
function  gives  rise  are  given  below  for  several  values  of  a  ranging  from 
0  to  1. 

(i)  a  =  0  .  When  a  =  0  ,  equation  (2.2.7)  becomes  0  (t)  =  cos  t. 
The  binomial  distribution  of  the  random  variable  x  which  assumes  values 
+  1  with  equal  probability  has  characteristic  function 

1  it  1  -it 

fe  +  -|e  =  cos  t  . 

Thus,  the  limiting  distribution  for  a  -  0  is  tlis binomial  distribution. 

(ii)  a  *  -4  .  In  this  case,  equation  (2.2.7)  becomes 

00  00 

®  (t)  =  COS  (3t/2  n+  ) 

n=0 


(2.2.8) 


' 

. 


00  Qr,  ,  O  00  _  co 

=  H  cos(3t/2  )  IT  cos(3t/2  n+  i)/ IT  cos ( 3t/2^n+^ 
o=0  o=0  n=0 

00  00  1 

=  [J  cos(3t/2n)  ff  sec(3t/22n+l)  . 
n=l  o=0 


By  a  well  koownformula  [see,  for  example,  Kac  [ 13 ] >  page  1,  2], 

00 

n  c°s(3t/2n)  =  . 

n=i 


•x(t)  -  ^  ft  sec(3t/22n+1)  . 

0  n=0 

Now,  if  we  let 


- 

sin  3t 

3t 

k-l  „  , 

V*)  ■ 

^  n 

9  n=0 

(k  >  1)  , 

(2.2.9) 

Thus , 

(2.2.10) 


we  observe  that 

<J>  (t) 
xN  ' 


lim  ¥  (t)  , 

k->  00 


^(t) 


sio  3t 

3t 


sec 


21 

2 


_  sln..0.t/2). 

(3t/2) 

=  ^o^A)  c°s(3tA)  » 


yo 


sin(3L/2l 

(3t/2) 


sec(3t/2^) 


cos 


(Jt/O 


and 


s  in(  5t/2^) 

(3L/25) 

=  'i'1(t/10  cos  (3t/b) 


sin(3t/2^) 

(3t/23) 

cos  (3t/U) 

.  sec  (3t/25) 

sin(3t/25) 

(3t/25) 

cos  (3t/l|  ) 

.  cos  (3t/2k) 

=  f2(t/k)  cos  (3t/h)  . 


This  suggests  the  induction  hypothesis, 


(2.2.11) 


\(t)  =  \_i(t/i)  cos  (}t/k) 


k-1 

n 

n=l 


/  /^2n . 

cos  (3i/2  )  . 


Under  this  hypothesis, 


s  in  3t 

3t 


k 

n=0 


sec(3t/22n+1) 


=  \(t)  sec(3t/22k+1) 


sin  (3t/22k~1) 

(3t/22k_1) 


k-1 

n=l 


cos  (3t/22n)  .  sec(3t/22k+ 1 


sin  (3t/22k+1) 

(3c/22k+1) 


cos  (3L/22n) 


'i'k ( c/ll )  cos  (3t/t)  . 


This  is  the  same  result  as  that  given  by  equation  (2.2.11)  when  lc  is 
replaced  by  (k+l)  and  hence,  by  induction,  the  assertion, 

(2.2.12)  ¥nU)  =  Vi^A)  cosljt/U)  (..=1,2,3,  ...  ), 

is  true. 


Let  g^x)  ^>e  the  probability  density  function  corresponding 
to  the  characteristic  function  T^(t)  ,  so  that 


00 

SkU)  =  ^  i 


fk(t)e'itXdt 


0° 

=  I  ^k-l^/^  cos(3t/4)e'lt5<dt. 

uLoo 


Then,  since  ^  (t/K)  is  an  even  function  of  t  ,  we  have  by  a  result 

given  in  [14]  (page  rj), 


(2.2.13) 


8k(x)  =  2 [gk_1 (4x-3 )  +  gk_1(4x+3)]  (k  >  1)  . 


If  gQ(x)  is  known,  equations  (2.2.13)  determine  a  sequence  of  distributions 
which  tend  to  the  limiting  distribution  corresponding  to  00  =  ~  #  But 


g0W 


1 

n 


00 


*  'o 


sin  3t  cos  tx 

3t 


dt 


and  by  a  result  given  in  [15]  (page  60), 


grt(x)  =  Z  (-3  <  x  <  3) 


=  0 


(x  <  -3,  x  >  3). 


Thus, 


8l(x)  =  \ 


(-|  <  x  C  |) 


(x  <  *  >  f); 


0 


. 


and  so  forth. 


Li< 


o  - 


So(x)  =  I 


(-|  <  X  <  -|r,  |-  <  X  <  |) 


=  0  (x  <  <  x  <  B'  x  > 


§3(X) 


4 

3 


27  21 


J5 


(“32  <  x  <  "32  '  "32  <  x  <  32' 

«<><«.  S”<  s> 


21 


=  0  (x  <  -^2.  <  x  <  - 

u  ^  ^  52  ;  32  ^  ^  32  ' 


_I5  <  x  <  15  21<x<21  x>£>) 

32  32  '  32  32  *  32'’ 


n~  1 

Denote  the  2  intervals  of  nonzero  probability  density  which 

occur  at  the  n—  stage  by  I  (ro=l,2,  . ..,  2n  ^5  n  >  l).  The  2n 

Til  j  ID  " 

intervals  of  nonzero  probability  density  which  occur  at  stage  n+1  are 

constructed  from  the  intervals  of  the  preceeding  stage  in  the  following 

way*  divide  each  interval,  1^  into  eight  equal  subintervals,  say, 

i  1  ,  i  q  from  left  to  right  and  retain  as  the  two 

new  intervals  obtained  from  I  ,  the  union  of  i  _  and  i  _ 

n,nr  n,m,2  n,m,3 

and  the  union  of  i  ,  and  i  _.  The  value  of  the  probability 

n,m,o  n,ro,7 

density  function  over  each  of  these  new  intervals  is  double  that  of  the 

th 

probability  density  function  over  the  intervals  of  the  n  stage.  From 

this  construction,  it  follows  that  the  length  of  each  of  the  intervals  at 
th  o(n_]\ 

the  n  stage  is  3/2  '  '  and  the  value  of  the  probability  density 

n  **  1 

function  over  these  intervals  is  2  /3.  The  total  length  of  the  intervals 

at  stage  n  is  3  2  ”  /e  '  "  which  approachs  0  as  n  approach;, 
infinity.  Thus,  the  asymptotic  distribution  is  concentrated  on  a  set  of 
points  of  Lebesgue  measure  0.  This  set  is  constructed  in  much  the  same 


' 

. 


way  as  the  Cantor  Ternary  set.  This  suggests  that  the  limiting 
distribution  is  concentrated  on  a  nonenumerable  set  of  points  of  L<besgue 
measure  0  and  that  the  cumulative  distribution  function  is  not  absolutely 
continuous . 


(iii)  a  =  2.  When  a  =  £  ,  equation  (2.2.7)  becomes 


*,<*>  = 


00 


n=l 


cos 


(t/2n) 


=  sin  t/t  , 


Thus,  the  asymptotic  distribution  is  uniform  over  (-1,  l). 


(iv)  a  =  1//2  .  In  this  case, 


$  (t) 
xN  * 


00 

~r 

|  cos 
n=0 


00  00 

[  cos[(*/2»l)t  2~n]  j]  cos[(2-V2)t  2"n3 


n=l 


n=l 


=  { sin[  (N/2-l)t  ]/(s/2-l)t)  .  [sin[(2-*/2")t  ]/(2-*j2)t) 


Now,  sin[(\/r2~l)t3//(*/2-l)t  and  sin  [(2V2)t]/(2-s/2)t.  are  the  characteristic 
functions  of 


and 


f 


(K)  =  2p^TT 


0 


f  /  \  = 

(x) 


z]2{ J2-1) 


=  o 


(-  [n/2-1]  <  x  <  [n/2-1]) 

(x  <  [n/2-1]  ,  x  >  [n/F-1]) 


(-/2[n/2-1]  <  x  <n/2[n/2~1]) 


(x  <  -n/¥[*v/2“1]  ,  x  >  n/2[n/2-1]  ), 


respectively.  Thus,  $  (t)  is  the  characteristic  function  of  the 

X 


convolution  of  a  uniform  distribution  over 


( -  [n/2- 1  ] ,  [n/S-  1  ] )  with  a 


-  he  - 


uniforin  distribution  over  (-[2-n /IF]  ,  [2"*f2])t 


In  order  to  determine  the  limiting  distribution,  consider  the 
more  general  case,  where 


(2.2.15) 


(2.2.16) 


gaGO 


2a 


=  0 


8b(x) 


1_ 

2b 


=  0 


(-a  <  x  <  a) 

(x  <  -a,  x  >  a)  , 
(-b  <  x  <  b) 

(x  <  -b,  x>b)  , 


and  a  >  b  .  The  convolution  of  g  (x)  and  g,  (x)  is 

ci  D 


f(x)  =  /  8a(y)  gb(x-y)  dy  . 

.00 


From  (2.2.15)  and  (2.2.16),  the  product  ga(y)  gb(*-y)  is  — -  when 
y  satisfies  simultaneously  the  conditions,  -a  <  y  <  a  and  x-b  <  y  <  x+b  5 
otherwise  it  is  zero.  Therefore, 


f(x)  =  0 


(x  +  b  <  -  a)  , 


1 

lUb 


x+b 

dx 


-a 


x+a+b 

ij-ab 


(x-b  <  -a  <  x+b)  , 


1 

4ab 


x+b 


dx  = 


x-b 


2a 


(x-b  >  -a  and  x+b  <  a)  , 


1 

s^b 


dx  = 


x-b 


a+b-x 

Vab 


(x-b  <  a  <  x+b)  , 


0 


(x-b  >  a)  . 


■ 

, 


t 


1+9  - 


By  applying  these  results  to  the  case  a  =  and  b  =  sl~2 l-l  (so 

that  a+b  -  1  and  a-b  =  the  probability  density  function,  f(x), 

of  the  limiting  distribution  is  obtained.  It  is  given  by, 


f  (x)  =  l+x/tWi^-l)2] 

=  l/[2s/2(\/2-l )  ] 

=  1-x/[iJ2(n/2-1)2] 

=  0 

(v)  a  =  l  .  Here,  <&  (t)  =  1 

A 

is  given  by 


(-1  <  X  <  -[3-3/ 2])  , 

(-[3-»/2]  <  x  <  [3-3/2])  , 
([3-3/2]  <  x  <  1)  , 

(x  <  -1,  X  >  1)  . 

and  so  the  asymptotic  distribution 


P [x=0]  =  1 

P [x=*w ]  =0  w  ^  0 


-  50  - 


CHAPTER  HI 


ADDITIONAL  RESULTS  FOR  CERTAIN  LIMITING  DISTRIBUTIONS 


Let  g(x  5  a)  denote  the  distribution  function  which  is  0,  1  or 

# 

1  according  as  x  <  -a,  -a  £  x  <*  a  or  x  >  a  .  Then  g(x  ;  a)  is  called 
a  symmetric  Bernoulli  distribution  function  and  has  as  its  characteristic 
function,  cos(at)  .  The  distribution  function,  say,  cr(x)  ,  of  the 
convolution  of  infinitely  many  such  distributions  is  the  distribution 
function  of  the  quantity  x^  +  X2  +  •••  >  where  the  for  i  =  1,  2,  ... 

are  independent  random  variables  such  that  x.  is  -a.  or  a.  with 

ill 

probability  \ .  The  characteristic  function,  say,  L(t  ;  o),  of  cr(x)  is, 
therefore, 

00 

L( t  5  or)  =  {]  cos(ant) 

n=l 

provided  that  this  infinite  product  converges.  The  characteristic  function 

/ 

given  by  equation  (2.2.7)  is  a  particular  case  of  this  function.  Thus  many 
properties  of  the  desired  limiting  distributions  are  given  in  studies  relating 
to  L(t  ;  cr).  Some  of  these  properties  are  stated  below. 


Jessen  and  Wintner  [l6]  have  shown  that  o(x)  is  either  absolutely 

continuous  or  purely  singular.  Wintner  [12]  proved  that  when  a^  =  2  n/^ 

(n  =  1,  2,  ...  )  h  =  1,  2,  ...  ),  cr(x)  is  absolutely  continuous  and  acquires 

derivatives  of  arbitrarily  high  order  for  every  x  as  h  — ►  o°  ,  there  being 

fch 

assured,  for  fixed  h  ,  the  existence  of  a  continuous  (h-l)  derivative 
for  every  x  .  Kershner  and  Wintner  [17]  have  shown  that  for  the  case 
a  =  an(0  <  a  <  l),  cr(x)  is  singular  and  concentrated  on  a  set  of  measure 


I 

fi  X  2  *“ 
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zero  when  a  <  2  ,  that  <r(x)  is  concentrated  on  the  Cantor  set  when  a  =  , 

.7 

and  that  cr(x)  =  if  -1  <  x  <  1  and  a  =  \  .  On  the  basis  of  the  work 

done  by  these  authors,  Erdos  [l8]  proved  that  for  the  case  a  =  an  ,  there 

n 

exist  values  of  a  ,  with  2  <  a  <  1  ,  for  which  cr(x)  is  purely  singular. 
Examples  are  a  =  \  -  l)  and  the  positive  root  of  the  cubic  equation 

+  a^  -  1  =  0  (This  root  is  between  1/12  and  1/^2  ). 


From  the  summary  of  known  results  which  has  been  given  above  and 
in  Chapter  II,  it  is  clear  that  the  general  character  of  the  asymptotic 
distribution  is  known  for  0  <  a  <  \  in  processes  involving  two  experimenter- 
controlled  events  and  the  equal  alpha  condition.  With  the  restriction 
7t^  =  =  2;  imposed,  an  explicit  solution  has  been  determined  for  a  =  1//2 

and  a  method  suggested  for  determining  the  asymptotic  distributions  corresponding 
to  the  members  of  the  sequence  =  1 /  4 2  ,  n  =  1,  2,  . . .  .  Although  it  is 
known  that  purely  singular  solutions  occur  for  =  2  and  certain  values 

of  a  in  the  interval  (2 ,  l)  ,  the  set  of  values  in  this  interval  for  which 
the  solution  is  singular  is  not  known. 


The  present  chapter  of  this  thesis  presents  some  new  results  and 
includes  an  extension  of  some  of  these  results  to  the  more  general  process 
discussed  in  Section  Three  of  Chapter  I.  In  Section  One,  a  set  of  functional 
equations  for  the  characteristic  function  of  the  limiting  distribution  are 
derived  from  equation  (2.2.1).  These,  in  turn,  yield  a  set  of  functional 
equations  for  the  limiting  cumulative  probability  distribution.  One  member 
of  this  latter  set  proves  useful  in  Section  Two  and  is  suitable  for  use  in 
the  numerical  approximation  of  the  limiting  distribution.  A  system  of  n 
functional  equations  for  the  limiting  cumulative  probabilit)  distribution 


' 
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function  yields  an  explicit  solution  for  the  raw  moments  of  the 
limiting  distribution.  In  Section  Two  a  simple  procedure  is  developed  for 
computing  the  limiting  probability  density  function  corresponding  to  given 
members  of  the  sequence,  a,  =  1/^2  (n  =  1,2,  ...  ),  The  character  of 
the  several  solutions  exhibited  in  Section  Two  suggests  that  the  sequence 
of  limiting  distributions  corresponding  to  the  members  of  the  sequence 
=  I/n/2  (n  =  1,  2,  ...  )  tends  in  the  limit  to  the  normal  dis¬ 
tribution.  This  is  proven  (without  recourse  to  the  Central  Limit  Theorem) 
in  Section  Three.  Finally,  in  Section  Four,  several  basic  results  are 
extended  to  the  case  of  learning  processes  involving  t  experimenter- 
controlled  results. 


SECTION  ONE 


If  m(t)  denotes  the  moment  generating  function  of  the  limiting 
distribution  of  response  probabilities  in  a  learning  process  involving  two 
experimenter-controlled  events  and  the  equal  alpha  condition  then  [by 
equation  (2.2.2)]  m(t)  satisfies  the  functional  equation 

t(l-a)V  t(l-a)* 

(3.1.1)  m(t)  =  m(at)[jt1e  +  ] 


Denote  by  m  (t)  and  0  (t)  ,  respectively,  the  moment  generating  and 

X  X 

characteristic  functions  of  the  distribution,  over  (-1,  l),  of  the 
random  variable  x  given  by 


+  ^1 

2p 

.>2  ‘  Al  - 

-X2  "  A1  J 

x 


(\p  /  )  ’ 


- 
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where  p  is  the  random  variable  of  the  limiting  distribution  of  response 
probabilities.  Then  $  (t)  =  m  (it)  and,  using  result  (2.2.4), 

X  X 


<&  (t) 
x v  ' 


=  exp 


(3.1.2) 


=  exp 


it( 


.t( 


'z  +  H,1 

X2  -  A^ 


+  A, 

- - -) 

a2  -  a/ 


mi 


-2  it 

A2  "  V 


<t>( 


-2t 


\  "  A1 


)  , 


where  <J>(t)  =  m(it)  and,  by  equation  (3.1.1),  $(t)  satisfies 

it(l-a)^  it(l-a)Ap 

(3.1.3)  #(t)  =  $(at)[jt1e  +  n2e  ]. 


Thus,  from  (3.1.2)  and  (3.1.3); 

0(0  =  |  (*2  "  Al^  6Xp^  1T^2  +  Al^ 


and 


®*f-  I(a2  •  V]  =  exp[-  lt21~a,4>>2  +  vi  *k[-  r  -  V1 


X  [« 


it(l-a)A^  it(l-a,)A2 

1  +  1t2e 


] 


or 


$  (t) 

x 


"f  ^  1 

d>x(at)  exp [ i t ( 1  -a ) (^-  -- y- ) ] 


X  [it  1e 


-2it  ( 1-a.  )A  l7f  2->  -2i  t  ( 1-a  ^ 


+  *2e 


giving 


/  \  A  /  sr  it ( 1-a.)  -it ( 1-a.) , 

*x(t)  =  <I>x(a't;)[jt1e  +  «2e  J  . 


(3.1» 
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Equation  (3. 1.4)  implies  that 


*x(e)  =  f[  ] 

n=l 


and 


*„(')  ■  V*3*)  II  U1elt(l'a)an+1«2e-it(l-a)aI’'1] 

n=l 


Continuing  in  this  way,  we  obtain 


(3.1.5) 


$ 


(t)  =  *x(art)  ft  [nieit(l-Hn;1n2e-it(1-a)an'1],(r>1). 

n=l 


In  the  expansion  of 


n  r  it(l-a)an"1  -it ( l-a)an” 1 

11  lJt1e  +  Jt2e  ]  , 

n=l 


there  are  2  terms  each  of  the  form 


it*"™  exp[-it(l-a)(C;L  +  c2a  +  ...  +  c^1"1)] 


r-1 


where  c ^ ( j  =  1,  2,  . . .  ,  r )  is  +1  and  m  is  the  number  of  the  which 

have  value  -1.  Let  X^(i  =  1>  2,  ...  ,  2r )  denote  the  possible  values  of 


(3.1.6) 


r-1 


(l-a)(Cl  +  c2<x  +  ...  +  cra  ) 


and  let  ro(X  . )  be  the  number  of  the  c  .  which  have  value  -1  when  X  . 

'  n'  j  n 


is  expressed  in  the  form  of  (3.1.6).  Then, 


m 


*Jt)  =  ®x(a  t) 


(X  .  )  r-m(X  .  )  -itX 


*  rj  jt 
*1  n2 


rj  e  *3 


(3.1.7) 


“  55  - 


By  the  inversion  theorem  (see,  for  example,  Kendall  [ 19 3  page  94), 
the  cumulative  probability  distribution  function,  say,  F(x)  ,  corresponding 
to  $v(t)  is  given  by 

-ixt 


x 


00 


(3.1.8) 


F(x)  =  F(o)  + 


1  -  e 


2*  loo  U 


^x(t)  dt  , 


where  F(x)  =  0  for  x  <  -1  and  F(x)  =  1  for  x  >  1  ,  since  x  is 
distributed  on  (-1,  l).  Then,  if  a  is  any  constant 


t  r  00  .  -ixt 

s  /  Ljtr  •  e'  a  •  * 


2jt 


00 


I  .  -it(X+a)/ar  ,«0  -ita/ar 

- - — -  .  $  (t)  dt  -  ^  /  ~ — -  •  $ 


-00 


2"  loo  ic 


so  that,  by  equation  (3.1.8), 


O  —00 


F(^)  -  F(-f)  (r  >  1) 

a  a 


From  equations  (3.1.7)  and  (3.1.8)  it  follows  that 


oo  „  -ixt 


f(*)  -  F(°)  ■  h  JL  1TT—  •  *x(t>  dt 


m(Xr.)  r -*»(XrJ)f1 
*1  *2  |2* 


oo  -ixt  -itX  . 
1-e  rj 

e 


j  =1 


-00 


it 


$  (a  t)  dt 
x'  ' 


(t)dt, 


and  hence,  by  equation  (3.1.9);  that 


1 
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(3.1.10)  F(x)  -  F(0)  =  \  «J 


m(X  . )  r~m(X  ) 
ri7  '  ri7 


i=l 


F( 


x+X  .  X  . 

rix  ri 


a 


)  -  F(-Si) 
a  J 


In  particular, 


^  m(X  .)  r-m(X  .) 
F(-l)  -  F(0)  *2  11 

i=l 


-1  +  Xri  Xri 

F( - r- i)  -  F(— —) 

a  a1  • 


But, 


X  .  <  (l-a)(l+a+  ...  +  ar_1)  =  l-ar 


ri 


and  hence 


-1  +  X  .  .  r 

F( - 

a  a 


=  F(-l)  =  0 


and  since  F(- 


-1  +  X 


ri 


)  >  0  ,  we  must  have  F(- 


-1  +  X 


ri 


a 


)  =  0  (i  -  1,  2,  ...,  2r) 


a 


Thus, 


F(0)  =  >  it. 
i=l 


m 


(X  , )  r-m(X  . )  X 


ri 


it. 


ri  F  (— ) 

\  r  / 

a 


and 


(3.1.11) 


F(x) 

i=l 


m(X  . )  r-m(X  . )  x  +  X  . 
'  n7  '  n7  _  /  ri 

it„  F  ( - 

2  r 

a 


)  (r  >  1)  . 


When  r  =  1  ,  equation  (3. l.ll)  becomes 


(3.1.12) 


\  r,/x  +  1  -  .  „/X  -  1  +  a,x 

F(x)  =  tc-F (— - - - )  +  F( - - - -)  . 


a 


a 


This  result  will  be  applied  in  Section  Two. 
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Equations  (3. 1.11 )  imply  that 


(3.1.13) 


d  F(x)  7t1 


i=l 


m(X  )  r-m(X  )  x  +  X 

ri  #  rl  d  F(— j-Ei)  . 

a 


Let  V  denote  the  raw  moment  of  the  distribution  of  tha  ^.r^cr:; 

s 

variable,  x  .  Then,  since  d  F(x)  =  0  when  x  <  -1  and  x  >  1  ,  we  find 
front  equation  (3.1.13)  that 


Y 

f  V 


Vg  -  /  *  dF<*>  -2.  *1 

1  i=l 


r  cl 

-  «n(X  )  r-n(X  )  f  1  x  +  X 

rl  rl  x&dF(— — -Xi) 


-ar~X  . 
n 


a 


in 


(X  . )  r-m(X  . )  n 1 


ri'  '  ri'  /  r  r  ..  ,s 


*2 


i=l 


-LI 


[a  X“xri)  dF(x) 


m( 


<Xri>  r-m(Xri),x  ...  xr,  .. 

It,  1  )  (l,  )  a  (  “X  .  ) 


1  “2  'k 

i=l  k=0 


ri 


where 


a»  ,  (-►) 


k=0 


(r  >  1,  s  >  1)  , 


(s-k) 


m(Xri)  r-ra(Xri) 

=  / 

i=l 


(-X  .) 
\  ri  / 


s-k 


Thus, 

(3.1. 1*0 


s-1 


Vs(l-arS)  O  akr  Vk  Sr(5-k)  (r  >  1,  s  >  l) 


k=0 


vo  = 


1 
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From  equations  (3.1.14)  we  obtain  a  system  of  s  equations  in 
the  unknown  quantities  V^,  V  ,  . . .  ,  Vg  by  choosing  the  equations 
corresponding  to  r  =  1,  2,  . . .  ,  s  .  That  is, 

-  f  <.!„>  v.  ■  hM 

k=l 

s-1 

(3.1.15)  Vs(l-a2s)  -J  (s!k)a.2(s‘k)Vs.k  S2(k)  =  S2(s) 

k=l 


k=l 


(s  >  1) 


When  =  jt 2  ~  *  ’ 


(s-k) 


-  (i)'  £  (-Xrl) 


s-k 


i=l 


is  zero  if  s-k  is  odd  since  the  terms  in  the  sum  will  then  cancel  in  pairs 
At  the  same  time  ^gS+l  ~  ®  (S  =  0,  1,  2,  . . .  )  since  the  characteristic 
function  of  F(x)  [see  equation  (2.2.7)]  Is  even.  Thus,  for  the  case 
=  2  we  obtain,  from  equations  (3. 1. 15)>  the  following  system  of 
equations  in  the  unknown  quantities  Vgg  ,  ^  >  • • •  >  • 


: 
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s-l 


V2s(1-2s)  -  £  (22!2k)  a2(S-k>V2s_2k  Sl(2k)  .  S 


k=l 


(3.1.16) 


v2s(!-a4s) 


S-l 


k=l 


(  26  )  a2<2s-2k>V  S  <2k> 

'2s-2k'  2s-2k  2 


(2s) 

2 


S-l 


2s 


/.  2s  v  \  /  2s  x 

1-<X  >  2s-2k  a 


2s  ^s(2s-2k)  o  (2k) 

2s-2k  s 


S  (s  >  1) 

s  —  J 


k=l 


Provided  that  /  ^2  ’  exP^^c:*-1:  solutions  are  obtained  for 

V  (s  =  1,  2,  ...  )  by  solving  equations  (3.I.I5).  The  determinant  of  the 

s 

coefficients  of  V^,  Vg  ^  ,  and  ,  say,  Dg  is  given  by 


where 


D 


s  >  ft  2_ 


X  (s!k) 


a.  . 


a . 


(l-a1S)  (i  =  1,  2,  ...  ,  s) 


il,rt. 


and 


a.  .  =  -a 


i(s- j+1 )  _  (j-1) 


S.XJ  /  ( i  ^  2,  .. .,s;  j -2  ,  3  ^ • « • t  s 


Thus, 


V 


X  Ck) 


b .  . 


/  D 


s,jt. 


b.  . 


a.  . 

ij,«- 


(3.1.17) 


: 

t  ■  .-*  e 


. 


6o  - 


where 


bM  =  S. 

il,  it^  1 


(s) 


(i  =  1,  2,  ...  ,  s) 


and 


b.  . 


=  a.  . 


lj,  it. 


(i  —  Ij  2,  ...,  s  j  j  —  2,  3>  •••;  s)< 


On  the  other  hand,  if  -n^  =  it^ 

equations  (3.1.16).  If  D  i 

<-  s ,  z 


=  I  ,  V  (s  =  1,  2,  ...)  is  found  from 
is  the  determinant  of  the  coefficient  of 


V  V  •••  -  v 


2s  ’ 


D 


Sff1  (  2s 

,11  V2s-2k 
k=l 


) 


a.  .  l 

1J,2 


where 


a  L  =  (l-a2is)  (i=l,  2, 
1.  x  ^  2 


s) 


and 


a.  i  =  -ai(2s"2j+2)  s<'2}'2')  (1=1,2,  ...,  s;  j=2,J, . . . ,  s), 

1J;2  1 


Thus,  for  it  ^  =  n2  =  2 

(3.1.18)  V 


2s 


s-1 


n  (2f!2k) 


■k=l 


2s-2k' 


b.  .  i 

1J,2 


/D: 


2s,  l 


b.  .  i 

ij>2 


a.  .  i 
lj>2 


where 


(2s) 


(i  -  1,  2,  ...,  s) 


' 
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and 


b 


(i  —  1,2, . . . ,  s  *  j  _  2, 3, 


gives 


Since 


We  observe  that  when  =  jtg  =  i  ,  the  first  of  equations  (j.l 


V0  =  (1-  a  ) 


2n-1  c  (2) 


X 


11 


(l-a)  and  X12  =  (l-a)  , 


Sx(2)  =  i[(l-a)2  +  (l-a)2] 


=  (l-a)‘ 


and 

(3.1.19) 


1  -  a 
1  +  a 


SECTION  TWO 


Consider  a  learning  process  with  two  experimenter  controlled 
events,  and  E^  for  which  =  l/v2  ,  where  n  is  some 

positive  integer,  If,  in  addition,  ix  =  jt _  =  j  ,  $  (t)  satisfies,  by 

1  C  X 

equation  (2,2.7); 

00 

(3.2,1)  <J>  (t)  =  J]  cos  [t(l-l/rV2’)(Ii/2)‘j]  , 

j=0 


where  $  (t)  is  the  characteristic  function  of  the  distribution  of  the 
x 

random  variable 


A2  +  V 

f-2£_l 

-A2  "  Al- 

-A2“A1- 

(A2  / \)  , 


.16) 
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and  p  is  the  random  variable  of  the  limiting  distribution  of  response 
probabilities  for  the  process  just  described. 

Since,  by  formula  (2.2.9)  > 


cos 


s  In  [2t  ( 1-  lA'2  )2~  |  k~ 1  |/n  ] 

[2t(l-l/?2)  2-(k-1,/“]  ' 


we  have,  from  equation  (3.2.1), 


ft  sin[2t(  1- 1/^/2  )2~(k~1^/n] 
k=l  [2t(l-l/1V2)2'^k“1^//n] 


(3.2.2)  ®x(t)  =  (Cn/tn)  fl  sin(bkt)  , 

k=l 


where 

d  =  1/|n/2  -  l)n  •  2<n-1V2] 

and 

bk  =  2C42  -  l)2-k/n  . 

We  observe  that  if  F^x)  tbe  uniform  distribution  over  (-b^,  b^)  , 

0  (t)  is  the  characteristic  function  of  the  convolution  of  F  (x),  Fp(x),  .  . 

and  F  (x)  . 
n 
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Keroeny  and  Snell  [9]  page  229  have  shown  that  in  the  case  being 
considered,  the  limiting  probability  density  function,  say,  f^(x)  ,  exists 
for  all  x  ,  It  is  given  by 


n 


00 


(3.2.3) 


£  (x)  =  5—  /  e  4  (t)  dt  . 

n  2*  x 


Now,  in  the  expansion  of 


(2i)  Q  [}  sin(ht) 


k=l 


=  [7  ^exp[2it(rV2-l)2"k//n]  -  exp[  -2it(  IV2-l)2’k/n  ] 

k=l  L 


ji 


there  are  2  terms,  each  of  which  may  be  written  as 


(~l)'!l  exp{it(  n/2-1)[c1+c2(1V2)  +  ...  +  cn(1V2)n"  ]}  , 

where  c^  ( i  =  1,  2,  . . . ,  n )  is  +  1  and  m  is  the  number  of  the  c^  which 
have  value  -1.  Let  X  .  (i  =  1,  2,  ...,  2n)  ,  with  X  .  <  X  .  n  ,  denote 
the  possible  values  of 


(3.2.4) 


(lV2-l)[c1+c2(Ii/2)  +  ...  +  cn(  ^/2)“-1) 


and  m(X  .)  be  the  number  of  the  c.  which  have  value  -1  when  X  .  is 


ni 


ni 


expressed  in  the  form  of  (3.2,4).  Then 


(3.2.5) 


m 


0  (t)  =  lcJ/(2it)n 

x  |  Cv 


m 


(-D 


(X  .)  itX 


nj 


J 


inl¬ 


and  using  (3.2.3); 
(3.2.6) 


.n 


C 


00 


n 


f  (x)  - 
n  „n+ 1 


2  Jt 


m(X  .  )  -it (x-X  .  ) 
(-1)  rJ  e _ _nJ 


j  ’1 


/  •  \n 
(it) 


-Ht. 


■ 

, 


. 
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It  will  now  be  shown  that 


(3.2.7) 


dr  (x) 
n 

dxr 


00 


n 


2n+1n 


2n  m(X  . )  -it(x-X 
-  d  '  \  1  (-!)  Dj  e 

-CO  dxr 

J  ^1 


(it)n 


dt 


for  r  >  1  , 


The  integrand  on  the  right  hand  side  of  equation  (3.2.7)  is 


.n 


dx 


(-D 


m 


(X  .  >  -it (x-X  . 1 


r  J 


r.J 


i  =1 


/  .  \n 
(it) 


J 


sn 


2“  r+m(X  .)  -it(x-X  .) 
/  ,  \  m  n  1 

(-0  e _ i. 


.  _  \n-r 


J  -1 


(it) 


(-l)W  -itx  n 


n-r 


e  FT  sin  (b.  t ) 

k=l  k 


which  tends  to  0  as  t  — ►  0  and  is  continuous  in  t  for  all  x  .  The 
corresponding  integral  (see,  for  example,,  Pierpont  [20]  page  453)  exists 
and  is  uniformly  convergent  for  all  x  (ibid, , page  467).  Assertion  (3.2. 
(ibid.,  page  493)  from  these  observations. 


When  r  =  n-1,  equation  (3.2.7)  becomes 


(3.2.8) 


dn"lf  (x) 
n  ' 


dx 


n-1 


(-i)"-V 

2n+1* 


(X  ,  )  -it(x-X  .  ) 
v  rj  nj 

”t-  '  ‘J 


00  sin  [  t(x-X  .  )  ] 
'  ni 


7)f ollows 


dt 


dt  . 


■ 
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But  (see,  for  example,  Kendall  [19]  page  95); 


(5.2.9) 


r\°°  sin[  t(x-X  .  )  ] 
ni 

t 

0 


dt  = 


Jt 

*2 


ZL 

2 


V 


x  <  X  . 
ni 


JO  x  =  X  . 

ni 


x  >  X 


ni 


dn'1f  (x) 

for  i  =  1,  2,  . . . ,  2n  .  Thus  — 


dx 


n-1 


is  constant  for  all  x  and  f  (x) 

n '  ' 


is  piecewise  a  polynomial  of  degree  not  exceeding  (n-l). 


Now  Xnl  =  -(  n/2-1)[1  +s[2  +  ...  +  (  $2)n~l]  =  -1  and 

Xn2  =  +T^2’  +  ...  +  (^r1]  =  -5  +  2  (V2)  .  When 

-1  <  x  <  -5  +  2(^2)  , 


Thus,  when  -1  <  x  <  -5  +  2(  ^)  the  coefficient  of 


xn  1  in  f  (x)  is 
n 


(3.2.10) 


(n-l).'  2 


— -  =  l/[(n-l):  (  ?/2-l)n  #n'^2]. 


It  was  shown  [see  equation  (5.1.12)]  that  the  cumulative  probability 


distribution  function,  F(x),  of 
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X  = 


A2  *  A1 

A2  "  A1 


2R. 


A2  ‘  A1 


(A2  / 


satisfies,  for  general  a  and  , 


F(x)  =  fgF(^— +  ^(^±2:)  (-»<  X  <oo  ) 


(3.2.11) 


=  0 


(x  <  -1) 


=  1 


(x  >  1)  . 


When  jtj  =  jt0  =  2  and  a  =  1/*V2  (n  >  l)  ,  equation  (3.2.11) 


becomes 


2F(x )  =  F[(ns/2)x  +H/2-1]  +  F[(IV2)x  -^2  +  1]  (- oo<  x  <oo)  , 

=0  (x  <  1)  , 

=  2  (x  >  1)  . 

Thus,  the  probability  density  function  f^(x)  satisfies 

(3.2.12)  2fn(x)  =rV2{fn[(IV2)x  +n^-l]  +  fn[($2)x  -V^+l]}  (  ~CO<  X  <  oo  , 

=0  (x  <  -1,  x  >  1 )  , 

or 

(3.2.13)  fn(x)  =  2(-n-1Vn  fjZrJhl)  _  fjx  +  2(1-  ^2)]  (-oo<x<co), 

v  2 

=0  (x  <  -1,  x  >  l)  . 


Now,  when 

-1  <  x  <  -3  +  2(  n/2)  ,  x  +  2(1  -  v2 )  <  -1 

f n [ x  +  2(1  -lV2)]  =  0  . 


and 


Thus,  for  -1  <  x  <  -3  +  2(^2), 


0 


X 


&.  o 

^2  i  il  4BdM<J 


<»  a,  -  .  +  -  . 


'  '  *  ■  >  >  i - 
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f  (x)  satisfies 
n 


(3.2.14) 


This  equation  has  as  its  solution 


(3.2.15) 


fn(x)  =  c  (1  +  x) 


where 


:n  =  i/(n-i): 


in  order  that  the  coefficient  of  x 


n-1 


n-1 

(tJ/2-l)n  2(>-1)/2 

be  the  same  as  that  given  in  equation 


(3.2.10). 


For  convenience  denote  by  Tn^(x)  and  T  „(x)  ,  respectively, 

X  -  V2  +  1  1 — 

the  transformations.  -  and  x  +  2(l  -tv2)  .  For  all  x  such  that 

2 

-1  <  T^(x)  <  -3  +  2(  n/2)  and  -1  <  T^^x)  <  -3  +  2(V2)  ,  that  is,  for 
-3  +  2(  $2)  <  x  <  -1  -2(  T2)+  2(1/2)2  , 

fn(x)  =  Cjtx+l)”"1  .  [x  +  3  -  2(s(2)]n'1)  . 

When  T^(x)  is  in  the  interval  (-1,  -3  +  2(1\/2))  and  T^(x)  is  in 

(-3  +  2(  91),  -1  -2(1/2)  +  2(  ’-yi)2)  , 

(3.2.16)  fn(x)  =  Cn{(x+l)n'1-[x+3-2('!/2)]n'1-[x+l+2(')/2)-2(n«/2)2]1''1) 

The  upper  boundary  for  the  set  of  values  of  x  for  which  (3.2.16)  holds  is 
the  minimum  of  and  x^  found  by  solving 

Tnl(x1)  =  -1  -2(1/2)  +  2(1/2  f  and 


Tn2vX2)  =  -3  +  2(1/2)  . 
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While  it  appears  that  this  procedure  could  be  continued  and  f  (x) 

n 

determined^  the  problem  of  determining  the  remaining  boundaries  sequentially; 

in  this  manner;  has  not  been  solved  for  general  n  .  Equation  (3.2.8) 

suggests  that  these  boundaries  are  X  X  ...  ,  X  ,  which  are  found 

nl  n2  „n  ' 

n,2 

when  the  possible  values  of 


(3.2.17) 


C'/I-l)[c1  +  c2("V2)  +  ...  +  cn(^/2)rl"1]  , 


where  c^  ( i  =  1,  2,  . . .  ;  n)  is  +  1  ,  are  ranked  in  order  of  magnitude. 

If  f  (x)  is  known  for  T  .. (X  .  , )  <  x  <  T  . (X  . )  and  T  _(X  .  ,)  <  x  < 
n'  nl  n;i-l  nl'  ni7  n2'  n;i-l7 

Tnp(Xm'  )  t  equation  (3.2. 13)  gives  f  (x)  for  X  <  x  <X  .  Since 

nc  ni  n  n;i-i  m 

f^(x)  is  known  [see  equation  (3.2. 15)]  for  -1  =  X^  <  x  <  X^  =  -3  +  2(*V2); 

equation  (3.2. 13)  can  be  applied  repeatedly  to  give  an  explicit  solution  for 

f  (x)  (-1  <  x  <  l)  provided  that  the  values  of  X  .  ,  T  , (X  . )  ,  and 
n  7  r  m  1  nl'  ni7  ’ 

Tn2(xni)  are  known  i  =  1,  2;  ...  ;  2°  . 

However;  it  has  not  been  possible  to  determine  the  X^(i=l;2;  ...;  2n ) 

for  general  n  .  For  a  given  value  of  n  ,  they  may  be  found  by  using;  as  a 

guide,  decimal  approximations  of  the  quantities  given  by  (3. 1.17).  From  these 

results,  T  . (X  ,)  and  T  0(X  )  (i  =  1,  2,  ...  ;  2n)  together  with  their 

7  nl  nl  n2  ni 

decimal  approximations  are  easily  determined.  With  the  aid  of  these 

approximations;  f^(x)  tken  found  by  means  of  equation  (3.2.13).  The 

method  is  illustrated;  in  the  follwoing  discussion;  by  the  calculation  of 

f^(x)  ;  f^(x)  )  and  fj_(x)  [Since  fn(x)  i-s  a  symmetric  function  these 

computations  are  carried  out  for  -1  <  x  <  X  .  (n  =  3)  5)].  Decimal 

n;2n  +1 

approximations  for  xn^>  T^(X^)  anc*  ^n2^ni^  be  denoted  by  , 

D  .  .  and  D  _  .  respectively  (n  =  3;  ^  r  >  i  =  1,  2  ,  . .  ,  2n  Vl)  . 

n 1 ;  1  n2 ; x 


* 
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(i)  n  =  3.  In  this  case  a  =  1/n^2  and  equation  (3.2,13) 

becomes 

(3.2.18)  fj(x)  =  S2/^fg(— )  -  f5[x+2(l-  3'a)]  (-oo<  x  <oo)  , 

=0  (x  <  -1,  X  >  1)  . 


By  equation  (3.2, 15), 


(3.2.19) 


f5(x) 


— l^)2 

32 (  ?2-l)5 


(-1  <  x  <  -3  +  2(3/2))  . 


Table  3. 1  gives  some  preliminary  calculations  which  are  necessary  in  the 


determination  of 


f5(x) 


by  the  method  discussed  above. 


i 

X3i 

A3i 

D.,  . 

31;  1 

°32,i 

1 

-1 

-1 

-1 

2 

-34 a(  ^2) 

-0,48o 

-0. 587 

-1 

3 

-l-2(  n/2)+2(3/2)2 

-0.345 

-0.480 

-0.865 

4 

3-2(  \^2)2 

-0. 175 

-0.345 

-0.695 

5 

OJ 

§ 

$ 

1 

+0. 175 

-0,068 

-0.345 

Table  3,3  Values  of  X_.  together  with  the  decimal  approximations,  A_.  , 

3i  31 

D_-  .  and  D_0  .  computed  for  X  ,  X  (X, . )  and  T_0(X  )  , 

31,1  32,:  r  3i  31  3i  32v  31 

respectively  (i  =  l,2,3>^-;5). 


Using  equations  (3.2.18)  and  (3.2.19)  and  the  results  given  in  Table 


3.1,  it  follows  that 


. 

:  :  - 

£?(x)  =  (x+1)2/32(^2-1)3 


(-1  <  X  <  -3  +  2(3n^2))  , 


(3.2.20) 


'32(4.03  t^+l]2-[x+5-2(^2)]2) 


1 

32(3/2-1)* 


1 

32(5n/2-1)5 


(-3+2(^/2)  <  x  <  -1  -2(5/2)+2(3n/2)2)  , 
([k+1]2-[x+3-2(?/2)  ]2-[x+1+2(5/2)-2(5n/2)2]2} 
(-1-2(5n/2)+2(5n/2)2  <  x  <  3  -2(5n/2)2) 

{[x+1]2-[x+3-2(5n/2)]2-[x+1+2(:V2)-2(5n/2)2]  2 

-[x-3+2(^/2)2]2) 


(3-2(5n/2)2  <  x  <  -3  +  2(  V2)2) 


(ii) 

become 

n  =  4.  Here  a  =  l/Vi  .  Equations  (3,2,13)  and  (3,2.15) 

(3.2.20 

f^(x)  -  25//  f^( - — )  -  f^[x+2(l-2  7  )]  (-cd<  x  <00 

=0  (x  <  -1,  x  >  l) 

and 

(3.2.22) 

=  C4(x+1)5  , 

respectively, 

where 

l/6(21/^-l)i 


Table  3.2  is  the  counterpart  of  Table  3.1  and  contains  results  necessary  in 
the  calculation  of  fi  (x)  , 
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i 

xu 

A4i 

Vi 

°42,i 

1 

-1 

-1 

-1 

2 

-3+2(21/4i) 

-0.622 

-0.682 

-1 

3 

-l-2(21//^)42(21/2) 

-0.550 

-0.622 

-0.928 

4 

-l-2(21/2^+2(25/i|) 

-0.465 

-0.550 

-0.842 

5 

3-2  (2^) 

-0.364 

-0.465 

-0.742 

6 

-3+2C21/2) 

-0. 172 

-0.304 

-0.550 

7 

-3+2  ( 2 )  -2  ( 2 l/2  )+2  (2^ ) 

-0,086 

-0.232 

-0.465 

8 

-l-2(21/^)+2(25/^) 

-0.015 

-0. 172 

-0.392 

9 

l+2(21/^)-2(25/4) 

+0.015 

-0.146 

-0.362 

Table  3.2 


Values  of  X,  . 

A.  .  .  D,  -  .  and  1D 
4i'  4l,i 

respectively  (i  = 


together  with  the  decimal  approximations 

42,1  o£  V  W  and  W  ’ 


1,  2, 


Thus, 

=  0^(x+l)^  (“1  <  x  <  X  )  , 

-  (x+1  )^-(x-X^2  )^}  <  X  ^  ^43^  * 

x 

=  C^{(x+l)^  •-  ^  (x-X^^^)^)  <  x  <  X}[ }[  )  , 

i=2 

x 

=  C4((x+1)5  -  J  (x-X4i)3)  (X44  <  x  <  x45)  > 

1=2 

X 

-  C4;(x+1)3  -  7  (*-*Ju>3J  V  <  x  <  V  ’ 

i=2 


.  - 

' 

. 

. 

i. . .  » 3  <  X  »  1 ) 

'  « 

J  I 
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X 

C4((x+1)3  -)  +  (x-X46)5} 

iT2 


<Xltf  <  X  <  V' 


X  X 

C4{(x+1)5-^  (x-X4i)5+^  (x-X4.)5} 

i=2  i=6 


(X4?  <  x  <  X48), 


^  8, 

C4((x+1)5  (x-X4.)5+^  (x-X4.)5} 

i=2  i=6 


(X48  <  x  <  X^  ), 


where  =  l/oCV^-l)^  211/2 

are  given  in  Table  3.2. 


.  The  quantities  X4i(i  =  1,  2,  . . 9) 


In  summary, 


V*)  =  C4{(k+1)3  -)  (x-X^.)3)  (X4i  <  x  <  XM+1) 


j=2 


(3.2.23) 


(i  =  1,  2,  3,  5), 


A  .  JL, 

Vx)  =  C4[(x+1)3  (x-X4.)3+  )  (x-X4.)3)  (Xu  <  x  <  X4  ,+1) 

j=2  j=6 

(i  =  6,  7,  8). 


(iii)  n  =  5  .  When  n  =  5  x  a  =  1/  and  equations  (3.2.  13) 

/ 

and  (3.2.I5)  become 

1  /5 

(3.2.24)  f_(x)  =  2V5f  (2il2+i)  _  f  [x+2(  1-21/5)  ]  (-oo<  x  <oo), 


'5V  pl/5 


=  0 


(x  <  -1,  x  >  l) 


and 


f_(x)  =  Cj(x+1) 


(3.2.25) 


(-1  <  x  <  -3  +  2(21/'3))  , 


a.  to  : 
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respectively,  where  =  1/jji l  (  il 2 - 1 ) '2  f\ 
necessary  in  the  calculation  of  f^x)  . 


Table  3.3  gives  some  results 


i 

X5i 

V 

14-1  . 

5M 

°52,i 

1 

-1 

-1 

-1 

2 

-3+2(21/5) 

-0.702 

-0.71a 

-1 

3 

-l-2(21/5)+2(22/5) 

-0.660 

-0.702 

-0.958 

4 

-l-2(22/5)+2(25/5) 

-0.606 

-0. 660 

-0.904 

5 

-l-2(23/5)^(2V5) 

-0.550 

-0. 606 

-0.848 

6 

3-2 (2^/5) 

-0.482 

-0.550 

-0.780 

7 

-3+2(25/9) 

-0.362 

-0.445 

-0.660 

8 

-  342  ( 2  )  -  2  ( 22/"' ) +2  ( 2  ? ) 

-0.308 

-0.  398 

-0.606 

9 

-l-2(21/5)4-2(2^/5) 

-0.266 

-0.362 

-0.564 

10 

-^+2(21/^)-2  >5/5  Vo  (2^/5  > 

-0.252 

-0.349 

-O.55O 

11 

/  1  /*5  ,  p  /*5.  ^  ,  4  /  ’“  \ 

—  '  )  Ti—  \  t-  '  )  ~~c-  \  C.  j  ~rt—  \  <—  J 

-0.210 

-0.308 

-0. 508 

12 

l42(21/^)-2(2V5) 

-0. 184 

-0.290 

-0.482 

1 J 

-l-2(22/5)+2(24//5) 

-0. 156 

-0.266 

-0.454 

12 

3-2(2L/:;  )+2  (22/5 )  -2  (24/5 ) 

-0. 142 

-0.252 

-0.440 

1> 

3-2  ( 22/5  )+2  ( 25/5 )  -2  ( 24/^ ) 

-0.084 

-0.210 

-0. 382 

lo 

3-2(25/5) 

-0.032 

-O.I56 

-0.330 

J-7 

-3+2(23/5) 

0.032 

-0. 102 

-0.266 

Table  3.3  Values  of  X 


A5i'  D5l,i' 

respectively 


^  together  with  the  decimal  approximations, 
and  D._0  .  of  X  , 

32,i  %’ 

(i  =  1,  2,  . . . )  17 ) • 


w 


and 


W' 
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By  the  same  method  as  that  used  in  the  cases,  n  =  3  and  ^  >  it 


may  be  shown  that 


(3.2.26) 


f5(x)  =  C5{(x+1)4  -  )  (x-X^)4)  (X5.  <  x  <  X5  l+1) 


j=2 


(i  =  1,  2,  . . . ,  6), 


c 

o 


=  c^{(x+l)4  -)_  (x-X^.)4  +  ^  (x-X5j)4) 


j=2 


j=T 


(X,,.  <  x  <  X,.  .  ,  ) 
v  5i  5,i+l' 


(i  =  1,Q,  . . . ,  16), 


where  Cj.  =  l/^:(5/2-l)5  27].  The  Xt_i(i  =  1,  2,  ...  ,17)  are  given  in 
Table  3.3. 

The  results  obtained  for  the  cases  n  =  ^,  k,  and  3  suggest  the 


c  on  j  ec  t  ur  e 


fk(x)  =  Ck{(x+l)k-1-  £  (x-X^-1)  (Xk.  <  x  <  Xk  i+1) 

j=2 

(i  =  1,  2,  . . . ,  k+1) 


k+1 


Ck((x+1) 


k-1 


j=2  j=k+2 


<  x  < 


xi  •  i) 

k, i+lx 


(i  =  k+2, 


2k-l) 


for  k  >  1 


where  Ck  =  l/£(k-l);  (k>/2-l)k  25k_1/2]. 


The  probability  density  functions,  f-^x)  and  f2(x),  were  obtained 


75 


e 


Figure  3. I  Probability  density  function 
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by  McGregor  and  Hui  [10]  pages  1264,  1286,  These  functions  and 

ft  (x),  and  f_(x)  are  illustrated  in  Figure  3.1. 

4  p 


f^(x), 


The  appearance  of  these  curves  suggests  that  as  n  -»  °o  ,  f^(x) 

tends  to  the  normal  probability  density  function.  That  this  is  so,  is  shown 
in  Section  Three, 


SECTION  THREE 

It  has  been  shown  [see  equation  (3.1.4)]  that,  for  =  a 

and  n1  =  rt0  =  z  ,  the  characteristic  function  <5  (t)  satisfies 

1  X 


(3.5.1) 


®v(t)  =  ®  (at)  cos [ t( 1-a) ]  . 


Let  z  = 


x-p 


,  where  p  and  cr  are  the  mean  and  standard 


deviation,  respectively,  of  the  distribution  of  x  .  Then,  since  0  (t) 

X 


is  even,  p  =  0  ,  =  cr2  =  [by  equation  (3.I.I9)],  and  z  =  x(~-]f  . 


1+a; 


If  $  (t)  =  E(e1Zt)  denotes  the  characteristic  function  of  the 
z 


distribution  of  z  , 


<J>  (t)  =  E 

z 


•  x-  (  l+a'\2  _ 

ixt(- — —  )  ^ 
v  1-a' 


=  ®  ] 
x  1-a 


or 


*x<‘> 


Thus,  by  equation  (3.3. l); 


®  f  t.(-r-— )2  ] 

ZL  vl-pa 


<J>zCat('X7f)2  3  cos  [t(i~a)] 


4 


, 
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or 


(3.3.2) 


P  ~ 

0  (t)  =  <t>  (at)  cos  [ t(  1-a  )z  ] 

o 


Since  0  (t)  is  an  even  function  of  t  , 

X 


oo 


(3.3.3) 


\  '  /  i^rt2r 

^(t)  =  ^2r ,  z,a  (2r):  ' 


r=0 


2r 


where  P2r  2  a  =  E(zcl )  .  For  convenience,  ^2r  z  a  and  ngr  z  ^(=  lim  H 


<x  — ■ +1 


2r , z,a 


) 


will  be  denoted  by  and  [± ^  ,  respectively.  With  this  notation, 

(3.3.2)  and  (3.3.3)  imply 


00 


(-l)rt2r 
^2r  (2r )] 


r=0 


00 


L-l£S2*X 

2r  (2r): 


00 


(-l)kC2k(l-a2)k 

(2k]1 


r=C 


k=0 


00  00 


r=0k=0 


/  i\r+K«2r/-  2\k 

n  (-1)  a  (1-a  ) 
^2r  (2r)!  (2k).’ 


2r+2k 


00 


•_1'  c  {}_  ta2r-2k(2r-2k) .*  (2k)j 
k=0 


2r-2k/ .  2Nk 
a  ( 1-a 


r=0 


This  implies  that 


^2r  =  (2r)'  /  4 


„2r-2k,,  2  .k 

a  ( 1-a-  ) 


2r-2k  (2r-2k).'(2k)T 


k=0 


=  (2r)i  ^ 


k=l 


a2r-2k( i^q2  )k 
2r"2k  (2r-2k).'(2k).’(l-a2r) 


which,  in  turn  gives 


- 


-  78  - 


M ' 
K2r 


2r~2k+l/ ,  2A~X 

=  lira  M2r  =  lira  (2r).*  )  H  — - - LkiLJ 

a-*l  2  a  1  2  (2r-2k).'(2k).,rac:  1 


=  (2r-l)  p 


2r-2 


r-1 

[I  (2r  -  2k  -  1)  . 

k=0 


P 


2r 


Thus,  77T"y,~  =  1/2  rl  .  The  characteristic  function  obtained  by  letting 

N1—  *■  )  • 

a  — >  1  for  the  distribution  of  z  ,  is  therefore, 


00 


o  r 

(-t  /2) 


r=0 


=  e 


-t2/2 


Since  e 


-t2/2 


is  the  characteristic  function  of  the  standardized  normal 


distribution,  it  follows  that  the  distribution  function  corresponding  to 
<J>  (t)  tends  to  the  distribution  function  of  the  normal  law  as  a  ->  1  . 


SECTION  FOUR 


Consider  the  Bush-Mosteller  learning  process  described  in  Section 

Three  of  Chapter  I.  This  is  a  process  with  t  experimenter-controlled  events, 

E^,  E2;  •  •  •  >  Et  wklck  occur  with  probabilities,  jt^,  ...,  rtt,  respectively, 

t 

It  is  assumed  that  £  jt .  =  1  ,  and  that  whenever  E.  occurs  the  new 

i=l  1  1 

probability  of  response  is  Q.p  =  o^p  +  a^  ( i  =  1,  2,  .  , . ,  t )  if  the 

previous  probability  of  this  response  was  p  .  The  probability  of  response 
on  trial  n  ,  which  we  denoted  by  p^  ,  ha6  any  one  of  tn  possible 
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values,  pni  (  i  =  1,  2,  tn)  ,  where  P[pR  =  PniJ  =  ?ni  . 

Pn0 

Let  m^(0)  =  E(e  )  denote  the  moment  generating  function  of 

the  distribution  of  p  .  Then, 

n  7 


n 


m  (0)  = 

n 


0p  . 

111  t> 

e  P  . 

ni 


i=l 


Since  Pn+1  is  QjPni  with  probability  rt^P^  (i  =  1,  2,  . tn  ; 
j  =  1^  2,  . . . j  t), 


m  (0) 

n+1 


t  t 


n 


(a.+a.  p  .  )e 

e  J  J  ni  *.P  . 

J  m 


j=l  i=l 


t  tn 

— i  a  .0  r  r-i  a.  p  .0 

>  V  J  {  >  •  J  ni  ^ 


j=l 


i=l 


or 


(3.M) 


a  .0 


,  (0)  =  />  m  (a .0) 

n+1  '  J  n'  j 

j  =  l 


This  result  is  derived  by  Bush  and  Mosteller  [6]  page  562 .  Although  it  has 
not  been  shown  that  the  limiting  distribution  of  response  probabilities  exist 
for  this  learning  process,  a  remark  made  by  Karlin  [8]  page  727  suggests  that 
the  proof  of  its  existence  follows  from  arguments  similar  to  those  given  in 
the  case  t  =  2  .  It  will  be  assumed  that  this  limiting  distribution  does 
exist.  On  letting  n  — >  °o  in  equation  (3.4.1)  we  obtain 

'r — 1  a  .  0 

(3.4.2)  m(0)  =  2  *je  J  m(aj0)  ; 

j=l 
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where  m(0)  =  lim  m  (d)  , 

n 

n  -+oo 


When  =  a  =  , . ,  =  .  a.  ,  equation  (3.4,2)  becomes 


(3.4.3) 


a  .9 


ro 


( 9 )  =  m(a 9)  ^  e 

j  =  l 


This  implies  that 


aa  .9 


m 


(a, 0)  =  m(a20)  ^  e  * 

j  =  l 


so  that 


By  continuing  this  procedure  iteratively,  we  obtain 


(3A.k) 


The  proof  of  the  convergence  of  this  product  for  a  <  1  is  similar  to  that 
supplied  in  the  case  t  =  2. 


Let  a.  =  (1~cl)7\.  ,  where  A.  is  the  fixed  point  of  the  operator 
1  1  &  1 


Q j  ( j  —  U 

...,  t).  Then, 

00 

(3A5) 

m(0)  =  {]  ■ 

i=l 

(1  ;•  1  } 

j=l 


If  $(#)  =  m(i@)  ,  where  i  =  -1,  is  the  characteristic  function  of  the 
limiting  distribution  of  response  probabilities,  is  given  by 


. 


81 


(3.4.6) 


He) 


i(l-a)0A  a-*'1  - 

tc  e 

r 


Suppose  that  A^  <  A^  <  ...  <  A^  and  that 

A  -  A 

\  =  \  +  ^r"1^  (~t~~l~  ^  (r  =  1,  2,  t)  . 


Then, 

(3.4.7) 


00 


1.(0)  =  [] 

j=l  L 


A  -A. 


7tr  exp[i0(l-a)aj"1{A1+  ( r-1 j  ] 


r=l 


Let  p  be  the  random  variable  of  the  limiting  distribution  of 
response  probabilities.  By  the  Trapping  Theorem  (see  [l]  page  99)  > 

P[pC  I]  =  0  where  I  is  any  interval  disjoint  from  the  interval  joining 
the  limit  points  A^  and  A^_  .  The  random  variable  ,  x  ,  given  by 


x  = 

is  distributed  between 
characteristic  function 


of 


2p 

TT 


A 


and  1.  Moreover , 
the  distribution  of 


(\  /  V  7 

if  ®x(0)  =  E(e10X) 

x  , 


is  the 


®  (0) 

x' 


A  t  +  ^  i 

exp[ig(^ - — )] 

1  “  t 


Thus, 

(3.4.8)  ®(fl)  =  exp[|2  (At  +A1)]  *x[-  §  (*!  -  A ) ]  . 

By  equations  (j.^.7)  and 


. 

- 


82  - 


V-  !(*i  -  \)i  ■  “pt#  *  Vi 


A  -A. 


exp[i$(  l-cc)a^~1{x^+(r-l)(— — -1) }  ] 


or 


V9)  =  exp[lS(^ 


)] 


00 

% 


n  exp 


_gl9  (y  (?\ !+(r- !)(“——■) } 


J-l 


W 


r=l 


1  Mt 


At  +  A1 

exp[i0(- - — -)] 

A1  "  At 


oo  r 

.  ,  A. +A, 

<H  lexp 

i) 

/  Jt  exp 

j=l  L 

L  A1  At  -J 

L —  L 

-i0(l-a)a  j”1 


(t±k2r} 

^  t-1  ' 


Since, 


00 

exp 

j=l 


j  1  A  f+A  -j 

-i0(l-a)aJ  1(=-^Ti) 

V\  j 


=  exp 


A  +Ai 


00 


(3>.9)  $x(0)  =  H  Jtrexp 


j=l 


r=l 


THE  CASE  a  =  l/Vt  ,  n  =  1,  2,  . .  . 


If  t  is  odd,  that  is,  t  =  2m  +  1,  where  m  is  some  non-negative 


integer,  equation  (3.^.9)  becomes 


- 


4 


-  8J  - 


oo  -  2m+l 

(3.4.10)  ®x(d)  =  T  |  ^  Jtr  exp[-i0(l-a)a^"1(m+l-r)/m]| 

r=l 


00 


m 


=  IT.  T  *,  -+ 


j=i 


ITH-1 


i0(l-a)aj’1(r/tn) 

it  ,e  '  +  rt 

m+r+1  m-r+j. 


r=l 


x  e 


-i0(l-a)a^  ^(r 


On  the  other  hand,  if  t  is  even,  that  is,  t  =  2m,  where  m  is  a  positive 
integer,  equation  (3.4.9)  becomes 


2m 


(3.4.11)  $  (0) 


CU  /”  f  ■  |  ■> 

II  {  )  *r  exp[-18(l-a)aJ-1(2,"2+  \  ~  2r)] 

j  =  l  ^  — 


r=l 


00 

-  n 


m 


j=l 


-i@(l-a)aj'1(2r-l/2m-l) 


it  .  e 
m-r+1 


r=l 


m+r 


x  e 


i0(l-a)aJ~  (2r-l/2m-l) 


Suppose  jt  =  -  ,  r  =  1,  2,  ...,  t  ,  Then,  from  equations  (3.4.10) 
and  (3.4.11), 


00 


m 


<  A 


2m+l 


1+2  cos{#(  l-a)ai  ^(r/m)} 
r=l 


(t=2tn+l,  m  >  0) 


(3>.12)  4>x(0)  ={ 


m 

00  , — 1 

^  fl  ro  !)  c°s(  @(  l—oo  )g.  ^(2r-l/2m-l)  (t=2m,  m  >  l)  . 

i=l 


r=l 


(0  <  0  t/+m2rj)  o(i>-i)c}»03  ^S*J  -  ■  [} 
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It  is  shown  in  Appendix  I  that 


CO 

ML  ssi 


m 


J-l 


1+2  y  cos{2r0/(2m+l) J } 
r=l 


(3.4.13)  if  J 


CO 


ID 


1 


1 

m 


cos { (2r  -  1  )0/(2ro)j)  (m  *  1,  2,  ...  )  . 


r=l 


By  means  of  these  identities,  equations  (3.4.12),  can  be  greatly  simplified 
for  the  case  a-  =  1/vt  .  Suppose  t  =  2m  +  l(  m  >  0)  ,  Then,  the  first 
of  equations  (3.4.12)  with  cl  =  1/  \/ 2m+ 1  becomes 


00 


m 


®x(0)  =  []  ~Y  1+2  )  cos^(I^2rD+1  -l)^2m+l)"j(2r/2ro)) 
j=l  r=l 


or 


®  ( 


2m  0 


x  n 


00 


m 


„  )  -n 

v2m+l  -1  j  =  l 


2m+l 


1+2  2  cos{2r0(IV2m+l)"J) 
r=l 


n  00 

=n  n 


k=l  j=l 

By  the  first  of  identities  , 

2m  9 


2m+l 


m 


U&)  coJ(2-n>fl)9(.V2^i)-k(2r); 

^  (2m  +  l)j 

r=l 


0 


x 


-nN/2i^+T  -  1 


[|  sin[(2m+l)Q(rV2n>+l)  k] 

k=l  [(2m+l)0(rV2ro+l)"k] 


Thus,  for  this  case, 

/,  l  a)  $  (Q)  =  Pf  sin[(2m+l)0(IV2m+l  - 

X  k=l  [  (2m+l  )@(n\/2m+l  -  l)/2m(n\/2m+l  )k] 
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If  t  =  2m  (m  >  l),  the  second  of  equations  (3,4.12),  with  a  =  l/lV2ra  , 
becomes 


or 


00 

Me)  =  IT  - 

x '  7  .1 I.  m 


j  =  l 


)_  cos[e(  -Dcv^r-s  |^i] 

r=l 


m 

\  cos[@(IV2m)  ^(2r-l)] 
r=l 


n 


m 


r=l 


(2.)J 


By  the  second  of  identities  (3,4.13) 


0  r (2m-l )9 _  rT  sin[2m<9/(IV2m)k3 
X  ‘V/S-l  "  [2m 9/C>j2n)k] 


k=l 


so  that 

(3>.15) 


n 


<D 


x 


(e)  =  n 


k=l 


sin[2mQ(  Vi^-l)/(2m-l)(H/2m)k] 

[2m0(n\/2ni-l  )//(2m-l )(  rV2m  )k ] 


From  equations  (3.4,14)  and  (3.4.15)  it  follows  that  for  a  = 


(t  >  1), 


(3.4.16) 


cD  (9) 

x 


n 

k=l 


sin[et(rV/t~l)/(t-l)(n\/T)^] 

[9t(rVt-i)/(t-1)(ivr)k] 


A  FUNCTIONAL  EQUATION 

It  was  shown  [see  equation  (3.4.3)]  that  m (O)  ,  the  moment 


generating  function  of  the  limiting  distribution  of  response  probabilities 
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in  the  case  of  t  experimenter  controlled  events,  satisfies 

t 


(3^.17) 


m((9)  =  m(cc@) 


9(  1-a  . 


tk  .e 
J 


J 


j  =  l 


when  =  a  ss  #  =  a  .  Let  p  be  the  random  variable  of  the 

limiting  distribution  of  response  probabilities.  Then,  if 
<  ...  <  >  the  quantity 


x  = 


xi  +  \ 
xi  *  xt  xi  \ 


2P 


(At/Ax) 


is  distributed  on  (-1,  l).  If  $(?)  =  ni(i0)  is  the  characteristic  function 
of  the  distribution  of  p,  $(@)  satisfies,  by  equation  (3.4.17), 


(3.4.18) 


i0(l-a)A  . 

<&(9)  =  $(a 0)  )  jt  e  J 


j=l 


and,  by  equation  (3.4.8), 


(3.4.19) 


4>(S)  =  e  2  0x[-  |  (A1  -  At)] 


where  $  ( t )  is  the  characteristic  function  of  the  distribution  of  x 
x 


Combining  equations  (3.4.18)  and  (3.4.19);  we  obtain 


id 


r(W  0 

y-f(VV] 


e  T(XrXt)]2."je 

j=l 
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or 


®  (9) 
x 


=  e 


A,+A 

V\ 


$  (gu9  ) 

X 


-2i0(l-a)Ay(X  rAt) 


jt  .e 
J 


j=X 


Thus, 


(3,4.20)  $„(0)  =  $  (a0)  /  jt.  exp[-i(l-a)@(A  -2A  .+A,  )A  -AJ]  . 

**■  ■“  J  L  j  1  '  t  1 


j  =  l 


Let  F(x)  be  the  distribution  function  for  the  variable,  x 
Then,  by  equation  (3.1.8), 


r  0 

(3.4.21)  F(k)  =  f(o)  +  ^  J 


00  .  -ixt 

1  -  e 


it 


$  (t)  dt 
x 


(~  Go  <  X  <  Oo) 


and,  by  equation  (3.1.9); 


(3.4.22)  F(2-±-i)  -  F(i)  .  ^  J  ■  'A1"'-  •  e'lta  •  ^(at)  dt  , 


where  a  is  any  constant. 


From  these  results  and  equations  (3.4  20), 


A'  r  x+((l-a)(*  -2A  +x  )/(A  -A.)) 
F(x)-F(0)-)  «j[f(. - i 

j=l 


a 


) 


(l-a)(A  -2A  +A  )A  -A,\ 

-p( - c  A  1  '  ) 


(.-oo< 


X 


But 


( 1-a) (A  -2A  +A. )  _i  +  i  _a 

[-1+  - =r - r — - - ]/a  <  - —  =  -1  and,  since  F(-l)  =  0, 

\  •  A 


<0 


'  . 


( 
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(3.4.23) 


A  x+((l-a)(A.-2A.+A.)A  -A,)) 

F(k)  =  )  *.F(, - ■  -Ca  -J..1  C  1  )  (-«<  x  <«,) 


j=l 


V\ 


If  =  ^1  +  ( )  (j  =  1;  2,  .  ..,  t)  equation  (3.4.23)  becomes 


(3.4.24) 


F(x )  =  ^  Tt  1  "a  ^  l~?-(  j  "1-)y(  t~ 1 ) )  )  (_oo<  X  <oo). 


j=l 


If  a  =  1 /n*/t  ,  jt  .  =  j  (j  =  1,  2,  .  ..,  t)  and  n  and  t  are 

J  t 

given ,  (3.4.16)  and  (3.4.24)  can  be  used  to  determine  the  probability  density 

function,  say  f  (x)  corresponding  to  $  ( 9 )  by  the  same  method  as  that 

n ,  t  x  v 

developed  in  Section  Two  of  this  chapter  for  the  case  t  =  2  .  For  example, 
suppose  n=2  and  t  =  3  .  In  this  case,  • 


sin[|(3V3)]  sin[j(-/3-l)] 

V0)  -  ir^-p -  • 

x  1(3^3) 


■  9, 


|  (^-D 


(3.4.25) 


V3(n/3-1)2  9 2 


[cos[0(2-,'/3)]  -  cos0] 


and,  by  equation  (3.4.24),  fn  _(x)  satisfies,  for  -<*><  x  <  oo  , 

2;  3 


JS  f2;3(x)  =  f2,3(^  x  +^3-1)  +  £2;3(^3  x)  +  f2  J-fS  X  -  •/>!) 


or  alternatively, 


(3.4.26)  f2  ,(x)  =  -f}  fg)3(x  "j}+  h  -  f2)5(x  -  -T}  +  l)-f2)3(x  +  2  -  a/3)  . 


*  ) 


r  co 


00 
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fo  *(x)  =  0  for  X  <  -1  or  x  >  1.  Using  (3.4.24),  f_  _(x)  for  -1  <  x  <  1 

-  ~  2,3 

is  also  given  by 


f2  ,(*)  =  7T-—  p 

2,5  '/3('/3 -l)2 


p  00 

—  /  cos  X0[cos[0(2-V3)]  -  cos0]/0^  d0 

Tt  ^0 


[  Jcos(e(x-2-n/3)  ]  cos[0(x+2V3)j 

?  /  ]  jg  +  o 

'*  L  0  0 


cos [a(x+l )  ]  cos  [0(x-l  )  ] 


d0 


a 


a 


Thus, 


df2f3(x) 

dx 


*J3('J~3- 1  )2«  ^ 


00 


sin[0(x+l) ] 

a 


00 


da  + 


sin[a(x-l) ] 


0 


a 


da 


00 


sin[a(x-2-^/3)  ]  f  ”sin[a(x+2V3)  ] 


0 


a 


o 


a 


da 


Using  equation  (3.2.9); 

(3.4.27) 


df2>> 


dx 


for  -1  <  x  <  -2  +  and, 


since  x 


-  +  1  <  -1  and  x  +  2  -  2/5  <  -1  , 


(3.*K  28) 


f2  ^(x)  =  ^  f2  *(*7^)  > 

2’3  2>3  41 


by  equation  (3.4.26).  Equation  (3.4.28)  has  as  its  solution 


V  n,  -  .  ...  V  -  - 


■ 
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fQ  *(x)  =  c(x+l), 

where  0  =  1/  sfjirJj- 1  in  order  that  the  coefficient  of  x  be  the  same  as 
that  given  by  (3.4.27). 

When  -2+-V3<x<2-</3,  -1<  <  -2  +4 ,  -1  <  x  -  Jj,  +  1  <  -2  +  •/? 

r-  ^ 

and  x  +  2-2\/3<-l.  Thus, 

f2  3^  =  (x+1)  '  (x+2_n^)^  (-2  +  n/3  <  x  <  2  -  n/3)  . 

When  2-\/5<x<-3  +  2/3  ,  -2  +  n/J  <  *  ~  ^  +  1  <  2  -  «/3  , 

V3 

-l<x-*/3"+l<-2+'/3  and  x+2  -  2/J  <  -1  .  Thus, 

f„  ,M  =  C{  (x+1 )  -  (X-2+/3)  -  (x+2^)}  (2  -  v/3  <  x  <  -3  +  S/3). 

3 


When  -3  +  2/3  <  x  <  1  ,  -2  +  < 


x 


V3+1 


^3 


<2  , 


-2  +  J3  <  x  -  n/3  +  1  <  2  -  sf3  and  -1  <  x  +  2  -  2J3  <  -2  +  \^3.  Thus, 


f0  ,(x)  =  C  { ( x+ 1 )  -  (X-2+/3)  -  (x+2^[J)  +  (x+3-2/3)  -  (x+3-2/3)} 
=  C{  (x+1 )  -  (X-2+/3)  -  (x+2^3)}  (-3W3  <  X  <  1). 


In  summary, 


f2  5(x)  =  •:  •  •  "  2 

2,3  vfWB-i)2 

1 

•JjbJj-i) 

1— X 


(-1  <  X  <  -2  +  n/3), 

(-2  +  n/3  <  x  <  2  a/3), 
(2  -  Vj  <  x  <  1). 


s 


-  91  - 


BIBLIOGRAPHY 


[1]  Bush,  Robert  R.  and  Hosteller,  Frederick.  Stochastic  models  for  learning. 

New  York!  John  Wiley  and  Sons  Inc.,  1955. 

[2]  Bush,  Robert  R.  "A  survey  of  learning  theory."  Developments  in  mathe¬ 

matical  psychology  (edited  by  R.D.  Luce).  Glencoe, 
Illinois:  The  Free  Press,  i960,  123-125. 

[3]  Bush,  Robert  R.  and  Estes,  William  K.  Studies  in  mathematical  learning 

theory.  Stanford,  California:  Stanford  University  Press, 

1959. 


[4]  Bush,  Robert  R.  and  Mosteller,  Frederick.  "A  mathematical  model  for 

simple  learning."  Psychological  Review  ,  1951*  58,  313_323« 

[3]  Bush,  Robert  R.  and  Mosteller,  Frederick.  "A  model  for  stimulus  general¬ 
ization  and  discrimination."  Psychological  Review  ,  1951 > 
58,  413-423. 

[6]  Bush,  Robert  R.  and  Mosteller,  Frederick.  "A  stochastic  model  with 

application  to  learning."  The  Annals  of  Mathematical 

Statistics,  1953.  24,  559-585. 


[7]  Bush,  Robert  R.,  Mosteller,  Frederick,  and  Thompson,  G.L,  "A  formal 

structure  for  multiple  choice  situations."  Decision 
processes  (edited  by  R.M.  Thrall,  C.H.  Coombs,  and 
R.L.  Davis),  New  York:  John  Wiley  and  Sons  Inc.,  1954, 
99-126. 


... 

1 


-  92  - 


[8]  Karlin,  S.  "Some  random  walks  arising  in  learning  models  I."  Pacific 

Journal  of  Mathematics,  1953,  3»  725~756. 

[9]  Kemeny,  John  G.  and  Snell,  J.  Laurie.  "Markov  processes  in  learning 

theory."  Psychometrika ,  1957,  22,  221-230. 

[10]  McGregor,  John  R.  and  Hui,  Y.Y.  "Limiting  distributions  associated  with 

certain  stochastic  learning  models."  The  Annals  of 
Mathematical  Statistics,  1962,  33,  1281-1285. 

E.C.  The  theory  of  functions.  Oxford  University  Press,  1939. 

"On  convergent  Poisson  Processes."  American  Journal  of 
Mathematics,  I935,  57,  835-838. 

Statistical  independence  in  probability  analysis  and 

number  theory.  New  York:  Cams  Mathematical  Monograph. 
The  Mathematical  Association  of  America,  1959* 

et  al.  Bateman  Manuscript  Project.  California  Institute  of 
Technology.  Table  of  integral  transforms.  New  York: 
McGraw-Hill,  195^- • 

A  short  table  of  integrals.  Boston:  Ginn  and  Co.,  1956. 

[16]  Jessen,  B.  and  Wintner,  A.  "Distribution  functions  and  the  Riemann  Zeta 

F unc t ion . "  Transactions  of  the  American  Mathematical 
Society,  1935,  38,  61. 

[17]  Kershner,  R.  and  Wintner,  A.  "On  a  family  of  symmetric  Bernoulli  Convol¬ 

utions."  American  Journal  of  Mathematics,  1935,  57 >  5^6- 

5^7. 


[11]  Titchmarsh, 

[12]  Wintner,  A. 


[13]  Kac,  Mark 


[14]  Erdelyi,  A. 


r  l  q  1  PiPirp  r  n 


-  93  - 


[18]  Erdos,  P.  "On  a  family  of  symmetric  Bernoulli  Convolutions.” 

American  Journal  of  Mathematics,  1939 »  61,  976. 

[19]  Kendall,  M.G.  and  Stuart,  A.  The  advanced  theory  of  statistics,  Volume  I. 

London:  C.  Griffin  and  Co.,  1958. 

[20]  Pierpont,  J.  The  theory  of  functions  of  real  variables,  Volume  I. 

Boston,  Ginn  and  Co.,  1905* 


•  • 

t£ 


-  9^  - 


APPENDIX  I 


Since 


and 


m 


sin(0/2m+l) 


1  +  2  y  cos(2r0/2m+l) 

L _ i 

r=l 


=  sin(0/2m+l)  + 


=  sin0  (m  >  0) 


ra 


r=l 


sin[(2r+l)0/2m+l]-sin[  (2r-l)0/2m+l] 


rn 


2sin(@/2m)  ^  cos [  (2r-l)0/2m] 


r=l 


m 


r=l 


s  in  ( 2r  0/2m  )  -  s  in  [  ( 2r  -  2 )  0/2m  ] 


=  sin0(m  >  l)  , 


it  follows  that 


r 


sin0 

0 


sin(0/2m+l) 

( 0/2m+ 1 ) 


m 

-1+2  cos [2r0/2m+l)- 


2m+l 


= 


m 


£"~(’0^ln')  Y  (cos[(2r-l)0/2m]/m) 


r=l 


(m=l , 2, . . . )  . 


Thus , 


sin0 

0 


m 


sinf 9/(2m+l j  3  TT  1 

[e/(an+i)s]  j=1  2m+1  1 

o  2  ra 

■inLe/laaU  n  I  V  cos[(2r.1)9/(2m)j] 


1  +  2  2^  Cos[2r0/(2m+l) J  ] 
r=  1 


[e/(2mr)  j=i 


r=l 


( ni— 1,2, ...  )  • 


' 


. 


stage , 


9K 


Continuing  this  procedure  iteratively,  we  obtain,  at  the  ri 


th 


Now, 


m 


sin0 

0 


sin[0/(2nH-l)n]  TT  _ 1 

,  n  i  i  1  2m+ 1 


[ 0/ ( 2m+ 1 ) U ]  j  =  i 


1  +  2  y  cos[2r0/(2m+l)'}] 
r=l 


n  m 

iM§zu'™irj  n  -  y 

te/(an)n]  j-i  m  r4i 


cos  [  (2r-  l)e/(2m)h 


(m=l,2, 


lin,  .  Um  ^ .  1  (m=l  ,2, . . . ) . 

[0/(2m+l)n]  n  -+oo  [0/(2m)n] 


n  — ►  00 


Thus , 


sin0 

0 


00 


A 


=  < 


2m+l 


m 


m 

1  +  2  ^  cos[2r0/(2m+l)^ ] 

r=l 


n  “j  Y  cos[(2r-l)0/(2m)j ] 


j=l  r=l 


( m=  1 , 2 ,  .  .  .  ) 


/ 
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